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Process for preparing optically active amino acids 
using a whole-cell catalyst 

The invention describes a process for preparing 
1 optically active L-a-amino acids, in particular the 
present invention describes a process for preparing 
compounds of the general formula (I) 

R^COgH 
(I), 

in which R is alkyl, in particular a space-filling 
branched alkyl group which exhibits a tertiary c atom 
and which possesses 5-10 C atoms, for example tert- 
butyl, and substituted alkyl, or salts which are 
derived therefrom. 

Optically active L-a-amino acids are used for preparing 
a number of valuable compounds. For example, these 
compounds function as intermediates in the production 
of pharmaceuticals. L-tert-Leucine, which can be found 
as a structural element in a number of pharmaceutical 
active compounds and is consecjuently required as an 
intermediate for synthesizing the corresponding 
pharmaceutical active compounds, is a particularly 
valuable representative of this product class. 

A. S. Bommarius et al (j moI n=,t- -a ^ 

a-L., (u . Mol. Cat. B: Enzymatic 

1998, 5, 1-11) provides examples of uses of L-tert- 
leucine as a building block for pharmaceutical active 
compounds . 

Using a leucine dehydrogenase and a formate 
dehydrogenase from Candida boidinii to enzymically 
reduce 2-ketocarboxylic acids while regenerating 
cofactor in situ constitutes an industrially 
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established method for preparing optically active 
L-a-amino acids. In particular, this route is suitable 
for preparing the non-proteinogenic amino acid L-tert- 
leucine, which is produced on the ton scale using this 
biocatalytic method. The method is described in detail 
in the literature (EP0692538; U. Kragl, D. Vasic-Racki, 
C. Wandrey, Bioprocess Engineering 1996, 14, 291-297; 
A. S. Bommarius, M. Schwarm, K. Drauz, J. Mol. Cat. B: 
Enzymatic 1998, 5, 1-11; G. Krix, A. S.: Bommarius, 
K. Kottenhahn, M. Schwarm, M.-R. Kula, J. Biotechnol. 
1997, 53, 29-39, A. Liese, C. Wandrey, A. Liese, 
K. Seelbach, C. Wandrey, Industrial Biotransformations, 
Wiley-VCH, Weinheim, 2000, p. 12 5f. and 

A. S. Bommarius, K. Drauz, W. Hummel, M. -R. Kula, 
C. Wandrey, Biocatalysis 1994, 10, 37-47. In addition, 
a general review is provided in A. S. Bommarius in: 
Enzyme Catalysis in Organic Synthesis (Eds.: K. Drauz 
and H. Waldmann) , Volume 2, 2nd edition, Wiley-VCH, 
Weinheim, 2003, chapter 15.3, p. 1047f.). 
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0 

R^COaH 
(») 



Scheme 1. Preparation of L-tert-leucine using isolated 
enzymes and added cof actor (taking as an example an 
NAD^-dependent amino acid dehydrogenase and a formate 
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dehydrogenase for regenerating cofactor) 

Typical quantities of NADh- cofactor which are used, and 
which have to be added, are described, for example, in 
EP0692538 and are in the range of from 0.0008 
equivalents to 0.02 equivalents. In addition, G. Krix 
et al. (J. Biotechnol. 1997, 53, 29-39) describe the 
preparation of ( S ) -neopentylglycine in industrial batch 
sizes using an NAD"" cofactor quantity of 0.003 
equivalents. Typical substrate concentrations in 
EP0692538 are 100-250 mM. A. Liese et al . (Industrial 
Biotransformations, Wiley-VCH, Weinheim, 2 000, 

p. 125f.) describe the preparation of L- tert-leucine 
using a substrate concentration of 0.5 M and with a 
yield of 74%. G. Krix et al . (J. Biotechnol. 1997, 53, 
29-39) also describe the performance of reductive 
aminations using isolated leucine dehydrogenase and 
formate dehydrogenase enzymes at substrate 
concentrations of from 0.5 to 1 M. 

The high turnovers and outstanding enantioselectivi- 
ties, which are > 99% ee and consequently help to meet 
the strict quality demands placed on pharmaceutical 
intermediates, are advantageous features of this 
method. It is also possible to operate at high 
substrate concentrations, something which is an 
important aspect particularly from the industrial point 
of view. 



However, a disadvantage of the previous method is, in 
the first place, the requirement for isolated enzymes. 
These latter are used, in particular, in purified form, 
with this being accompanied by an increase in the share 
of the costs due to the biocatalyst. Because of the 
high enzyme costs resulting from this, it is necessary 
to recycle the enzymes many times in order to obtain a 
favorable process economy, in particular low enzyme 
costs. In addition to the long running times of these 
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recycling procedures, which are advantageously carried 
out continuously, the relatively small reaction volumes 
per batch which result from this are disadvantageous. 

Another disadvantage is the requirement for cofactor 
which is added in the reaction. While these cof actors 
are added catalytically in orders of size of approx. 
0.001 equivalents, they nevertheless represent, because 
of their high price, a considerable cost factor even at 
catalytic quantities . 

A process in which the necessity of using isolated 
enzymes and of adding cofactor is dispensed with, or 
the addition of cofactor is kept to a minimum, and the 
synthesis nevertheless proceeds with a high turnover 
rate, high enantioselectivity and high volumetric 
productivity, would therefore be desirable. In this 
way, it would be possible to lower enzyme costs 
considerably and save on cofactor costs, and 
consequently increase the economy of the process. 

Soda et al . describe the use of a whole-cell catalyst, 
comprising a leucine dehydrogenase and a bacterial 
formate dehydrogenase, in the reductive amination of, 
inter alia, branched-chain a-ketocarboxylic acids such 
as L-tert-leucine (Appl . Environm. Microbiology 1997, 
63, 4551-4656) . This publication explicitly points out 
that the enzymes which are required in the reductive 
amination can be used in the form of a whole-cell 
catalyst, in particular E. coli, as live or resting 
cells, which comprises these enzymes. However, if 
preference were to be given to taking advantage of the 
intracellular pool of NAD"" in E.coli, for the purpose 
of avoiding having to add the NAD"", the final 
concentration of product would then be restricted to 
about 0.3 M. This is not adequate for industrial 
applications . 
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The object of the present invention was therefore to 
specify another process for preparing L-a-amino acids 
which operates enzymically and which can be carried out 
advantageously on an industrial scale. The process 
should, in particular, be superior to the processes of 
the prior art with the abovedescribed aspects and 
should make it possible to produce the desired products 
advantageously from the point of view of process 
economics (in particular space- time yield) , 

These objects, and other objects which are not 
specified in more detail but which ensue from the prior 
art in an obvious manner, are achieved by a process 
having the features of the present claim 1. Claims 2 to 
9 are directed preferred embodiments of the present 
process - 

Said object is achieved, in a manner which is extremely 
elegant and surprising but nonetheless advantageous for 
that, by, in a process for preparing enantiomerically 
enriched L-a-amino acids or their salts by reacting the 
corresponding 2-ketocarboxylic acid with an ammonium 
ion donor in the presence of a whole-cell catalyst 
which comprises a cloned gene encoding a cofactor- 
dependent amino acid dehydrogenase and a cloned gene 
encoding an enzyme which regenerates the cof actor, 
metering, at a total input of substrate per reaction 
volume of > 500 mM, the addition of the substrate such 
that the stationary concentration of 2-ketocarboxylic 
acid is less than 500 mM and the external addition of 
cofactor, based on the total input of substrate, 
corresponds to < 0.0001 equivalents. 

Surprisingly, it is possible, for example by using the 
whole-cell catalyst while at the same time metering in 
the substrate, to dispense with any addition of the 
expensive cofactor or, by means of making a minimal 
external addition (< 0.0001 equivalents), to keep its 
concentration in a low range, with this helping to save 
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on process input costs. By contrast, without this 
metering technology and when initially introducing 
substrate quantities per reaction volumes of > 500 mM, 
the reductive amination using the whole-cell catalyst 
only succeeds when relatively large quantities of the 
NAD+ cof actor are added. In the absence of the latter, 
the concentration only proceeds unsatisfactorily (see 
comparative example "synthesis example 1", initial 
substrate quantity per reaction volumes 900 mM - final 
turnover 2 5%) . It is consequently only by using the 
process according to the invention (see synthesis 
examples 2 to 4 ) that it is possible to be able to 
almost completely dispense with the external addition 
of the cof actor even when carrying out the synthesis 
with relatively high total turnover quantities per 
reaction volumes and consequently under conditions 
which make sense from the point of process economics. 

In a preferred embodiment, the expensive cof actor is 
therefore only added in quantities which are such that 
a concentration of preferably < 0.00005 equivalents, 
extremely preferably < 0.00001 equivalents, based on 
the substrate, is maintained. Very particular 
preference is given to an embodiment in which no 
cof actor is added externally to the reaction mixture. 
In this case, therefore, no addition of the cofactors 
(e.g. NAD{H)) need take place at all, something which 
it was not possible to deduce in an obvious manner from 
the prior art. 

Within the context of the reaction under consideration, 
the skilled person is free to choose the genes which 
encode a cof actor-dependent amino acid dehydrogenase 
and an enzyme which regenerates the cof actor, which 
genes are to be expressed by the whole-cell catalyst, 
as host organism. He will lean toward enzymes which are 
known from the prior art. 
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With regard to the amino acid dehydrogenase, suitable 
enzymes are, in particular, those which are selected 
from the group consisting of leucine dehydrogenases 
(US585403 5) and phenylalanine dehydrogenases 

(US5416019) - Amino acid dehydrogenases which have 
proved to be suitable are, in particular, the leucine 
dehydrogenases, with the leucine dehydrogenases from 
Bacillus strains, and, in this case, in particular, 
from Bacillus sphaericus, Bacillus cereus (Seq. ID 
No. 5) and Bacillus stearothermophilus being 
particularly suitable . Cof actor-regenerating enzymes 
which can be taken into consideration are those 
selected from the group consisting of formate 
dehydrogenases (EP1295937), malate dehydrogenases 
(PCT/EP/03/08631) , lactate dehydrogenases and glucose 
dehydrogenases (the latter in A. Bommarius in: Enzyme 
Catalysis in Organic Synthesis (eds.: K, Drauz and 
H. Waldmann) , Volume III, Wiley-VCH, Weinheim, 2002, 
chapter 15-3). The use of a formate dehydrogenase from 
Candida boidinii or mutants resulting therefrom 
(EP1295937; Seq. ID No. 7), while employing a formate- 
containing component as substrate, has proved to be 
very particularly preferred. 

In this connection, a whole-cell catalyst which 
comprises a leucine dehydrogenase and a formate 
dehydrogenase from Candida boidinii or mutants derived 
therefrom is particularly suitable. 

The substrate spectrum which is converted by the whole- 
cell catalyst differs depending on the amino acid 
dehydrogenase which is employed. While the leucine 
dehydrogenase is more suitable for linear and branched 
aliphatically substituted 2-ketocarboxylic acids, the 
phenylalanine dehydrogenase is preferably used for 
aromatic substituted substrates. With regard to the use 
of leucine dehydrogenase in the whole-cell catalyst, it 
is preferably possible to employ and convert substrates 
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of the general formula (II) possessing an aliphatic 
radical R 



O 



COOH 

(II) 



Substrates which possess bulky aliphatic radicals as R 
are particularly suitable. These R radicals are 
primarily those selected from the group consisting of 
1-adamantyl, neopentyl and tert-butyl. For this reason, 
preference is given to a process in which use is made 
of 2-ketocarboxylic acids, or salts resulting 
therefrom, which yield amino acids of the general 
formula (I) 



R 

A. 



H^N COOH 

(I) 



in which R is alkyl, in particular a space-filling 
branched alkyl group which exhibits a tertiary C atom 
and possesses 5-10 C atoms, for example tert-butyl, and 
substituted alkyls. 

In principle, the skilled person is free to choose the 
manner in which he carries out the process according to 
the invention. In this connection, he will lean toward 
processes which are known from the prior art. These 
processes can be continuous or discontinuous. It is 
advantageous to meter the addition of the substrate in 
accordance with a fed batch process [see, for example, 
synthesis examples 2 and 4] or by continuously adding 
it [see, for example, synthesis example 3] . In both 
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process variants, the substrate is added such that the 
stationary concentration of substrate is less than 
500 iriM. 

It has turned out to be advantageous to use the 2-keto- 
carboxylic acid employed as substrate at a maximum 
stationary concentration of less than 450 mM, and very 
particularly preferably of less than 400 mM, during the 
reaction . 

In the fed batch process, the substrate is added in 
portions, after given units of time and preferably as a 
substrate solution. The number of the substrate 
portions which are added is preferably between 3 and 
15, very preferably between 5 and 9. The concentration 
of the added substrate solution should preferably be 
set high enough to achieve a total input of substrate 
per reaction volume which is as high as possible , 
Synthesis examples 2 and 4 provide examples of this fed 
batch process variant. In the case of the continuous 
process variant, the substrate is added continuously 
over a given period of time, preferably at a constant 
metering rate, with the substrate preferably being 
added in the form of a substrate solution. Synthesis 
example 3 provides an example of this continuous 
process variant. 

All known cells are suitable for use as the whole-cell 
catalyst which comprises an amino acid dehydrogenase 
and an enzyme which is capable of regenerating the 
cof actor. Microorganisms which may be mentioned in this 
regard are organisms such as yeasts, such as Hansenula 
polymorpha, Pichia sp. , Saccharomyces cerevisiae, 
prokaryotes, such as E.coli and Bacillus subtilis, or 
eukaryotes, such as mammalian cells, insect cells or 
plant cells. The methods for cloning are well-known to 
the skilled person (Sambrook, J.; Fritsch, E. F. and 
Maniatis, T. (1989), Molecular cloning: a laboratory 



040055 AM 



- 10 - 



manual, 2nd ed. , Cold Spring Harbor Laboratory Press, 
New York). Preference is given to using E.coli strains 
for this purpose. Those which are very particularly 
preferred are: E.coli XLl Blue, NM 522, JMlOl, JM109, 
JM105, RRl, DH5a, TOP 10-, HBlOl, BL21 codon plus, BL21 
(DEB) codon plus, BL21, BL21 (DE3), MM294. Plasmids 
which can preferably be used to clone the gene 
construct containing the nucleic acid according to the 
invention into the host organism are likewise known to 
the skilled person (see also PCT/EP03 /07148 ; see 
below) . 

Suitable plasmids or vectors are, in principle, all the 
versions which are available to the skilled person for 
this purpose. These plasmids and vectors can be found, 
for example, in Studier and coworkers (Studier, W. F.; 
Rosenberg A. H.; Dunn J. J.; Dubendroff J. W. ; (1990), 
Use of the T7 RNA polymerase to direct expression of 
cloned genes. Methods Enzymol . 185, 61-89) or the 
brochures provided by the companies Novagen, Promega, 
New England Biolabs, Clontech or Gibco BRL. Other 
preferred plasmids and vectors can be found in: 
Glover, D. M. (1985), DNA cloning: a practical 
approach. Vol. I-III, IRL Press Ltd., Oxford; 
Rodriguez, R. L. and Denhardt, D. T. (eds) (1988), 
Vectors: a survey of molecular cloning vectors and 
their uses, 179-204, Butterworth, Stoneham; 

Goeddel, D. V. (1990), Systems for heterologous gene 
expression. Methods Enzymol. 185, 3-7; Sambrook, J.; 
Fritsch, E. F. and Maniatis, T. (1989), Molecular 
cloning: a laboratory manual, 2nd ed. , Cold Spring 
Harbor Laboratory Press, New York. Plasmids which can 
very preferably be used to clone the gene constructs 
containing the nucleic acid sequences under 
consideration into the host organism are, or are based 
on: PUC18/19 (Roche Biochemicals) , pKK-177-3H (Roche 
Biochemicals) , pBTac2 (Roche Biochemicals) , pKK223-3 
(Amersham Pharmacia Biotech) , pKK-23 3-3 (Stratagene or 
pET (Novagen) . 
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In another embodiment of the process according to the 
invention, before it is used, the whole-cell catalyst 
is preferably pretreated such that the permeability of 
the cell membrane for the substrates and products is 
increased as compared with the intact system. In this 
connection, particular preference is given to a process 
in which the whole-cell catalyst is, for example, 
pretreated by being frozen and/or by being treated with 
toluene. The essential features of the process 
according to the invention are shown in scheme 2 . 

The substrates can be employed at an extraordinarily 
high concentration when using the present process, as 
has also been described in the prior art when using the 
individual enzymes. In the present case, it is 
advantageous to employ the 2-ketocarboxylic acid at a 
concentration of greater than 500 mM. It is also 
preferred to introduce the substrate into the reaction 
at concentrations of greater than 800 mM, preferably 
greater than 900 mM and very particularly preferably 
greater than 1000 mM. However, in the case of this 
embodiment, it is essential to add cof actor to the 
reaction mixture in order to achieve corresponding 
turnover rates. 

If, however, it is wished, despite a high space- time 
yield being demanded, to use the whole-cell catalyst 
such that it does not become necessary to add the 
expensive cofactor externally, or only necessary to 
make an extremely small external addition of less than 
0.0001 equivalents, the skilled person can then 
surprisingly achieve this by the metering, in 
accordance with the invention, of the substrate. 

In the case of the present reaction, the procedure is 
preferably that the whole-cell catalyst and the 
ammonium ion donor are initially introduced in water. 
Any compound which is suitable to the skilled person 
for this purpose can be used as the ammonium ion donor. 
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In particular, these animonium ion donors are compounds 
which are selected from the group consisting of typical 
ammonium salts. Very particular preference is given to 
using ammoniiim formate when a formate dehydrogenase is 
selected as the cof actor regeneration system. The 
reaction can be depicted very clearly by means of the 
following scheme 2 . 



whole-cell catalyst: 



o 

X 

R-^COaH 
(II) 

total substrate concentration 
>500 nM; metered addition of 
the substrate 




amino acid dehydrogenase 
formate dehydrogenase, 
NAD® 




ammonium formate 
no addition of oof actor 
or cof actor addition < 0.0001 equlv. 



R^COgH 
NH2 
(1) 



Scheme 2. Principle of the reaction in the whole-cell 
catalyst process according to the invention (taking as 
an example an NAD""- dependent amino acid dehydrogenase 
and a formate dehydrogenase for regenerating cof actor) 

If other dehydrogenases are used instead of the leucine 
dehydrogenase, the conditions under which the enzyme in 
question functions optimally can be found in the prior 
art. The reader is referred to US5416019 and Galkin et 
al. (Appl. Environ- Microbiol- 1997, 63, 4651) with 
regard to using a phenylalanine dehydrogenase - 

With regard to the cof actor-regenerating enzymes and 
the conditions to be established, reference can be made 
to EP1295937 (formate dehydrogenase), PCT/EP/03/08631 
(malate dehydrogenase) and Enzyme Catalysis in Organic 
Synthesis (Eds.: K. Drauz and H- Waldmann) , Volume III, 
Wiley-VCH, Weinheim, 2 0 02 (glucose dehydrogenase) , and 
to the literature cited therein. 
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The reaction mixture is worked up using methods known 
to the skilled person. In the batch process, the 
biomass can be readily separated from the product by 
means of filtration or centrifugation. The amino acid 
5 which is obtained can then be isolated using customary 
methods (ion exchange chromatography, crystallization) . 

However, the present process can also be carried out 
continuously. For this, the reaction is carried out in 

10 what is termed an enzyme-membrane reactor in which high 
molecular weight substances, i.e. the biomass, are 
retained behind an ultrafiltration membrane and low 
molecular weight substances, such as amino acids which 
have been produced, are able to pass through the 

15 membrane- A procedure of this nature has already been 
described several times in the prior art (Wandrey et 
al. in year-book 1998, Verf ahrenstechnik und 
Chemieingenieurwesen [Process technology and chemical 
engineering], VDI, p. 151ff; Kragl et al . , Angew. Chem. 

20 1996, 6, 684). 

The process, which is presented here, for preparing 
amino acids, which are, in particular, bulky, can very 
readily be established on a commercial scale on account 
25 of its advantages. The surprising fact that the 

addition, which is necessary in the case of the 
reaction under consideration, of a cofactor can be 
dispensed with in the process according to the 
invention, as well as the advantages arising from the 
30 fact that the whole-cell catalysts are easy to manage, 
constitute the non-obvious superiority of the present 
invention over the methods of the prior art. 

Furthermore, it can be regarded as being surprising 
35 that the influence of undesirable metabolic /physio- 
logical functions is of no importance when using the 
whole-cell catalyst. Both aspects help, in an 
extraordinarily comprehensive manner, to lower the 
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process costs entailed in preparing the L-a-amino 
acids . 

It is furthermore surprising that, despite permeabili- 
zation of the cell wall and the possibility, associated 
therewith, of the cofactor present in the cells 
escaping, a negative impairment of the reaction which 
might be expected, for example as a result of the 
turnover being decreased, is not observed. 

Within the context of the invention, optically enriched 
(enantiomerically enriched, enantiomer enriched, 
enantiomerically pure) compounds are understood as 
meaning the presence of one optical antipode at > 50 
mol% when mixed with the other. 

The whole-cell catalyst is understood as meaning a 
microorganism which comprises cloned genes which encode 
enzymes which are at least able to catalyze two 
consecutive steps in the transformation of an organo- 
chemical compound. In this regard, and with regard to 
the general preparation methods (matching the enzyme 
expression with regard to the turnover rates) , the 
reader is referred to EP1216304. 

According to the invention, alkyl is understood as 
meaning a (Ci-Cis) -alkyl radical. This encompasses 
linear and arbitrarily branched radicals of this 
nature. It includes, in particular, methyl, ethyl, 

1- propyl, 2 -propyl, 1-n-butyl, 2-n-butyl, 1- or 

2- isobutyl, 1- or 2-sec-butyl, tert-butyl, etc. The 
radicals can be substituted once or more than once by 
(Ci-Ca) -heteroalkyl radicals or radicals such as OH, SH, 
Hal and NH2- Heteroalkyl radicals are understood as 
meaning, in particular, an alkyl radical as described 
above which possesses from 1 to 8 C atoms and which 
contains heteroatoms, such as O, S or N in its chain or 
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which is bonded, by way of these heteroatoms , to the 
molecule under consideration. 

External addition of cof actor means that this quantity 
of cofactor is added artificially to the reaction 
mixture. This quantity is to be seen as being in 
addition to the quantity of cofactor which is already 
inherently introduced into the reaction mixture by the 
whole-cell catalyst. 

It goes without saying that the 2 -ketocarboxylic acid 
which is used in the reaction is present in the 
reaction mixture in dissociated form. This foirm can be 
obtained either by using the ketocarboxylic acid and 
adjusting the pH correspondingly or by adding the salts 
of the ketocarboxylic acids. Both forms are included 
here analogously and in accordance with the invention. 

The term total substrate concentration stands for the 
total input of substrate per reaction volume. 
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Figures : 

Fig. 1 - PAM3.25 (Seq. ID No. 9): 
Construction of pJOE4580.2 



The plasmid pJOE4580.2 was formed from the published 
plasmid pJOE3075 (T. Stumpp, B. Wilms and 

J. Altenbuchner (2000) Biospektrum 1/2000: 33-36) by 
removing the malE gene by cutting with the restriction 
endonucleases Ndel/Hindlll and replacing it with two 
oligonucleotides which once again complemented the Ndel 
and Hindlll cleavage sites and, in addition to this, 
carried an Nhel, an Aatll and a PstI cleavage site. A 
Smal fragment from the plasmid pJOE773 

(J. Altenbuchner, p. viell, i. Pelletier (1992) 
Positive selection vectors based on palindromic DNA 
secjuences. Methods Enzymol 216: 457-466), which 
fragment carries the E.coli lacZalpha gene, was 
inserted into the Nhel cleavage site after filling 
using Klenow polymerase and dNTPs . When harboring this 
plasmid, E. coli JM109 gives blue colonies on LB plates 
containing X-Gal and IPTG. This plasmid was named 
PJOE4580.2. The FDH sequence (Seq. ID No. 7) was cloned 
into this plasmid. The resulting plasmid was named 
pAM3 .25. 



Fig. 2 - PAM5.22 
Constr-uction of pJOE4580.2 



The plasmid pJOE4580.2 was formed from the published 
plasmid PJOE3075 (T. Stumpp, b. Wilms and 

J. Altenbuchner (2000) Biospektr\im 1/2000: 33-36) by 
removing the malE gene by cutting with the restriction 
endonucleases Ndel/Hindlll and replacing it with two 
oligonucleotides which once again complemented the Ndel 
and Hindlll cleavage sites and, in addition to this. 
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carried an Nhel, an Aatll and a PstI cleavage site. A 
Smal fragment from the plasmid pJOE773 

{J. Altenbuchner, P. Viell, I. Pelletier (1992) Posi- 
tive selection vectors based on palindromic DNA sequen- 
ces. Methods Enzymol 216: 457-466), which fragment 
carries the E.coli lacZalpha gene, was inserted into 
the Nhel cleavage site after filling using Klenow 
polymerase and dNTPs . When harboring this plasmid, E. 
coli JM109 gives blue colonies on LB plates containing 
X-Gal and IPTG. This plasmid was named pJOE4580.2. The 
LeuDH sequence (Seq. ID No. 5) was inserted into this 
plasmid. The new plasmid is named pAM5.22. 

Fig. 3 - PAM8.21 



Construction of pHWG640.12 (Seq. ID No. 11) 

Plasmid PHWG640.12 has not previously been published 
and its construction is therefore described as follows. 
This plasmid pHWG640.12 is constructed proceeding from 
the published plasmid pAW229 in a manner which is 
readily reworkable. Plasmid pAW229 is a pACYC184 
derivative which contains a rhamnose promoter. 
Proceeding from pAW22 9 (B. Wilms, A. Wiese, C. Syldatk, 
R. Mattes, J. Altenbuchner (2001) J. Biotechnol 86: 19- 
30) , the hyuC gene was excised from the plasmid with 
Ndel/Hindlll and replaced with a PGR fragment which was 
cut with the same restriction enzymes and which 
contains the E. coli K12 sfcA (malic enzyme) gene. The 
resulting plasmid was designated pHWG640.12. The LeuDH 
sequence was inserted into this plasmid. The new 
plasmid is named pAM8.21. 

Fig. 4 - pAMlO.l (Seq. ID No. 10) 

The scfA gene (Seq. ID No. 11) was deleted from plasmid 
PAM8.21. The new plasmid is named pAMlO.l. 
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Fig. 5 

Biocatalyst with depiction of the course of the 
specific activity of leucine dehydrogenase (LeuDH) and 
formate dehydrogenase (FDH) , and of the optical 
density, in dependence on the induction time; for a 
detailed description of the fermentation conditions, 
see experimental section. 

Experimental examples 

Preparing the whole-cell catalyst 

Gene amplification and cloning 

In order to clone the formate dehydrogenase (FDH, fdh3 
from Candida hoidinii, mutant with lower sensitivity to 
oxidation) and leucine dehydrogenase {Bacillus cereus 
LeuDH) for the whole-cell catalysis of the conversion 
of trimethylpyruvate into tert-leucine with 
regeneration of cofactor, the genes for the two enzymes 
were first of all amplified by PGR from chromosomal DNA 
obtained from the abovementioned strains. The 
oligonucleotides employed are listed in Table 1 while 
the composition of the PGR mixtures is given in Table 2 
and the PGR program is given in Table 3 . 

Table 1: Oligonucleotides for amplifying the FDH and 
LeuDH genes 
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Oligo- 
nucleotide 


5 ' -3 ' seqijience 




Seq. ID 
No . 


S3713 


AAA AAA CTT AAG AAG GAG ATA TAG 
ATA TGA CAT TAG AAA TCT TOG AA 


LeuDH 
f oarward 


1 


S3714 


AAA AAA CTG GAG TTA GCG ACG GCT 
AAT AAT AT 


LeuDH 
reverse 


2 


S3723 


AAA AAA CAT ATG AAG ATT GTC TTA 
GTT CTT 


FDH 

f oiTvard 


3 


S3716 

.. 


AAA AAA GAC GTC TTA TTT CTT ATC 
GTG TTT ACC 


FDH 

reverse 


4 



The oligonucleotides were used to append cleavage sites 
for restriction endonucleases to the genes. These are 
5 Bfrl in the case of s3713, PstI in the case of s3714, 
Ndel in the case of s3723 and Aatll in the case of 
s3716 (see underlined regions) _ 

Table 2: PGR mixtures, polymerase, buffer and MgCl2 
10 originate from the company Biomaster; the plasmid DNA 
starting concentration was 5 0 |ag/ml 



Component 


For 
FDH 


Mixture 
for FDH 


for LeuDH 


Mixture 
for LeuDH 


Plasmid DNA 
from strain 
FDH-C235/C262A 




2 III 


pLeu2 

plasmid 

DNA 


2 ^il 


IQx buffer 




10 jLll 




10 |il 


5 0 mM MgC12 




3 Jll 




3 ^l 


100% DMSO 




10 |il 




10 |Lll 


10 mM dNTPs 




2 lal 




2 lal 


33 mM oligo 1 


S3723 


1 1^1 


S3713 


1 ^il 


33 mM oligo 2 


s3716 


1 111 


S3714 


1 m 


Taq polymerase 




1 ^il 




1 lal 


Deionized H20 




70 [ll 




70 |Lll 
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Table 3: PGR program: steps 2 to 4 were repeated 3 0 
times 



Step 


T, t for FDH 
amplification 


T, t for LeuDH 
amplification 


1 . Denaturation of 
the DNA 


940c, 5 min 


940c, 5 min 


2. Oligo annealing 


50°C, 1 min 


51°C, 1 min 


3 . DNA elongation 


72^0, 1: 30 min 


72^0, 1:30 min 


4 . Denaturation of 
the dsDNA 


92^0, 1 min 


92 oc, 1 min 


5. DNA elongation 


72 7 min 


72 oc, 7 min 



After the gene amplification, the PGR fragments were 
purified using the "DNA PGR and gel band purification 
kit" supplied by the company GFX and ligated into the 
L-rhamnose- inducible vectors pJOE4580.2 (pBR322 
derivative; Fig. 1) and, respectively, pHWG640.12 
(PACYC184 derivative; Fig. 3; Seq. ID No. 11) using the 
restriction endonucleases mentioned below. 

In general, restriction mixtures were prepared using 
approx. 50 |ig of DNA/ml in the 10 |al standard mixture. 
1 ^l of the first enzyme, and 1 ^il of the lOx 
concentrated enzyme buffer, were also added. The 
mixtures were adjusted to the final volume using 
deionized H20. The DNA to be inserted was incubated 
with the restriction enzymes separately from the 
plasmid DNA. After the restriction with the first 
enzyme, there then followed a precipitation step in 
which the DNA was precipitated with isopropanol and 
washed with ethanol and then dried and taken up in 8 |li1 
of TE 10.01. In each case I ]il of the second enzyme and 
1 |il of the second lOx enzyme buffer were added to 
these mixtures, which were incubated once again at 3 7 ^G 
for 1.5 h. The vector pAMlO.l was prepared from 
pAM8.21, this was also followed by a treatment with 



040055 AM 



- 21 - 



Klenow polymerase. The DNA was then separated into its 
fragments using a 1% agarose gel (Seakem agarose 
containing 0 . 4 |ig of ethidium bromide/ml) and the 
correct bands were excised with a scalpel for further 
use. The DNA was eluted, in accordance with the 
instructions, from the small gel blocks using the "EASY 
PURE gel purification kit" supplied by the company 
Biozym and taken up in 15 |il TE 10.01. 

For the ligation of vector and insert, the mixtures 
were selected such that the insert DNA was present at 
approximately twice the concentration of the target 
vector. In this case, too, the DNA concentration was 
approx. 50 |ag/ml . The final volume of the ligation 
mixtures was 10 |Lil, with the mixtures also containing 
1 ^1 of ligase and 1 ^il of lOx concentrated ligase 
buffer (both from ROCHE) in addition to the 
vector/insert mixture. The incubation took place 
overnight at 4^0. The ligation mixtures were 
transformed into E.coli K12 JM109, with this bacterium 
then being selected on LB agar containing antibiotics 
(100 ng of ampicillin/ml (pAM3.25 [Seq. ID No. 9], 
PAM5.22) or 25 lag chloramphenicol /ml (pAM8.21, pAMlO.l 
[Seq. ID No. 10]), and clones were checked for the 
expected plasmid after the plasmids had been isolated. 

Since LeuDH (Seq. ID No. 6) was initially to be coupled 
to malic enzyme (Seq. ID No. 12), the LeuDH gene was 
first of all inserted into pJOE4625.1, which already 
contained the gene for malic enzyme (sfcA) (Fig. 2) . 
The LeuDH gene was then inserted into pHGW540.12 
(Fig. 3), a pACYC184 derivative which also contained a 
rhamnose promoter and an sfcA gene, which latter was 
then deleted. The subcloning of the LeuDH gene from 
plasmid pAM5.22 (Fig. 2) into the target plasmid 
pAMlO.l (Fig. 4) was necessary in order to construct a 
two-plasmid system which requires two resistance 
markers for selection. 
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Table 4 : Cloning results 



Gene /vector 


Cloned, into 
plasmid 


Re s t r i c t i on 
with 


New 

designation 


Fig . 


FDH 

PGR fragment 


PJOE4580 .2 


Ndel, Aatll 


pAM3 . 2 5 


1 


LeuDH 

PGR fragment 


PJOE4625.1 


Bfrl, PstI 


pAM5 .22 


2 


LeuDH from 
PAM5.22 


PHWG640.12 


Bfrl, BamHI 


PAM8.21 


3 


pAM8 .21 


Without sfcA 
gene 


Muni, PstI 


pAMlO.l 


4 



Fermenting the whole-cell catalyst 

5 

After HPLC analysis had shown that the FDH/LeuDH 
combination {E.coli JM109 /pAM3 . 25/pAMlO . 1 ) achieved 
better results in converting t rime thy Ipyruvate into 
tert-leucine than a comparative model system (malic 

10 enzyme/LeuDH on pAM5.22) in miniature-scale (1 ml) 
experiments in a thermoshaker , plasmids pAM3,25 and 
pAMlO.l were transformed into E.coli BW3110 since this 
strain is more suitable for fermentations. The 
intention was to use high cell density fermentation to 

15 prepare a sufficiently large biomass for all the 
following investigations using the model system. The 
fermentation was carried out without any antibiotic, 
with the preliminary cultures having been grown in the 
presence of antibiotic, at BO^C in a 30 1 fermenter 

20 containing a final volume of 8 1 . For this, the cells 
were initially grown at 3 0°C as a batch culture up to 
an OD600 = 50 and until the glucose had been completely 
consumed (approx. 22h) . Gene expression was then 
induced by adding rharanose, which had been sterilized 

25 by filtration, to a final concentration of 0.2%, while 
fed batch culture was started by automatically adding 
nutrient solution and minerals (feed I and feed II) . 
Samples, whose OD and enzyme activities were 
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determined, using the respective activity tests in the 
latter case, were taken every two hours from the 
induction onward. The course of the OD, and of the 
activities, until fermentation was terminated are 
plotted against the time in Figure 5 . 

The fermentation was terminated 22h after the rhamnose 
induction since, despite increasing cell density, the 
activity of the FDH had stagnated and the cause of this 
was presumably plasmid loss or a reaction medium which 
was too acidic. The latter became apparent in the 
whole-cell reactions, in which the pH fell markedly 
(ApHmax = 0.8), as compared with a previously pH- 
regulated solution, when the moist biomass was added. 
The activities of the two enzymes reached 0.565 U/mg of 
total protein in the case of the LeuDH and 0.12 3 U/mg 
of total protein in the case of the FDH. The volume 
activities, based on the fermentation medium, were 
32.77 U/ml for the LeuDH and 7.14 U/ml for the FDH. 
After the medium had been removed in a separator, the 
cell yield was 1.4 kg of moist biomass. The cells were 
stored temporarily at -20oc until being used as whole- 
cell catalyst. 

Fermentation media 

Preliminary culture: 2 x 200 ml 

Preliminary culture medium: cNa2S04 x 10H2O = 2 g/1 

c(NH4)2S04 = 2.675 g/1 
CNH4C1 = 0.5 g/1 
CK2HP04 = 14.625 g/1 
cNaH2P04 x 2H20 =3.6 g/1 
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(stock solution in H20) 



autoclave in 90% by vol. 
H20 

cglucose = 10 g/1, final 
concentration 



autoclave separately 

IM MgS04 solution, 2 ml /I 
TES, 3 ml/1 

Thiamine stock solution (10 g/1 in H20) , 1 ml/1 

Batch culture: Add inoculum (380 ml in which Cx = 12 
g/1) containing glucose, MgS04, TES and thiamine in an 
inoculation flask to the autoclaved batch medium 



16 g 
21.4 g 
4 g 
117 g 
28.8 g 
8 g 

of H20 and sterilize in a 



Batch mediiom (c[uantity taken for 8 1): 
Na2S04 X 10H2O 
(NH4) 2S04 
NH4C1 
K2HP04 

NaH2P04 X 2H20 

(NH4) 2H-citrate 

dissolve in 7.5 1 

3 0 1 fermenter 

Glucose monohydrate 220 g 

dissolve in 5 00 ml of H20 and autoclave 
(25 g/1) 

IM MgS04 solution 16 ml 

24 ml 

Thiamine solution (10 g/1) 8 ml 

(sterilize the thiamine by filtration, 

autoclave the remainder) 

pH 7.2, using H3P04 and NH3 
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Fed batch feed: 

I. Glucose monohydrate 2750 g in 3 , 5 1 of H20 

autoclave 

MgS04 X 7H20 98 . 5 g in 0 . 15 1 of H20 

autoclave 

TES solution 0.5 1 

autoclave 

Thiamine 2 . 5 g in 0 . 5 1 of H20 

sterilize by filtration 
then combine in a feed flask 

II. (NH4)2HP04 396 g in 1 1 of H20, pH 7 

autoclave 

Feeds I and II are added using two separate pumps 
PH: 7.2 (titrated with H3P04 and NH3) 

PO2: approx. 50 kPa (regulated by the rotational 

speed of the agitator) 

Trace element 

solution (TES): CaC12 x 2H20 0.5 g 

ZnS04 X 7H20 0.18 g 

MnS04xH20 0.1 g 

Di-Na-EDTA 20.1 g 

FeC13 X 6H20 16.7 g 

CuS04 X 5H20 0.16 g 

C0CI2 X 6H20 0.18 g 
H20 to 1 1 
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Preparing L- tert-leucine using a whole-cell catalyst at 
900 mM without metering (comparative example 
synthesis example 1) 

50 ml of an 0.9 M solution of trimethylpyruvic acid (pH 
7-0, adjusted with 32% ammonia), which also contains 
1 mM magnesium chloride and 1% (v/v) toluene, are added 
to 5.85 g of the biocatalyst (E.coli JM105 (pAM 
3-25_10.1) biomass) and 7.95 g of ammonium formate 
(2.8 mol equivalents). The pH is adjusted to pH 7 . 0 at 
the beginning of the reaction and not regulated any 
further after that, resulting in the pH rising during 
the reaction. The reaction temperature is 3 0°C. After a 
reaction time of 8 h, a conversion of 24.6% is 
measured, with it not being possible to increase this 
conversion any further even after an additional 15 h of 
stirring. 

Preparing L-tert-leucine using a whole-cell catalyst at 
approx- 0.9 M and employing fed batch metering 
(synthesis example 2) 

23.84 g of ammonium formate (corresponding to 2.8 
equivalents based on the total substrate quantity 
employed) and 17.55 g of the biocatalyst (E.coli JM105 
(pAM 3.25_10.1) biomass) are initially weighed into a 
250 1 three-neck flask, after which 28.50 ml of 
deionized water and 150 ^1 of a IM solution of 
magnesium chloride (corresponding to a 1 mM 
concentration based on the final volume) are added. 
When the reaction temperature of 3 0^C has been reached, 
the reaction is started by adding 7.50 ml of a 1.8 M 
solution of trimethylpyruvic acid (pH 7.0, adjusted 
with 32% ammonia). The pH is then adjusted to 7.0 by 
adding 32% ammonia. After that, in each case 7.50 ml of 
a 1.8 M solution of trimethylpyruvic acid (pH 7.0, 
adjusted with 32% ammonia) are firstly metered in twice 
after which different volumes of a 1.8 M solution of 
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trimethylpyruvic acid (pH 7.0, adjusted with 32% 
ammonia) are metered in five times, with all the 
additions taking place at defined time intervals. The 
time intervals, and the quantities which are in each 
case metered in, are given in the following metering 
table- The final volume is 150 ml and the total 
concentration of added substrate is 0.86 M, 
corresponding to a volumetric quantity of trimethyl- 
pyruvic acid of 112.5 g/1. A complete conversion (> 98% 
in accordance with HPLC) is observed after a reaction 
time of 24 h. 



Metering table 


Substrate solution 


Substrate solution 


Time (h) 


ml (1.8 M) 


ml (0.9 M) 


0 


7.5 


0 


0.5 


7.5 


0 


1 


7.5 


0 


2.5 


0 


15 


4 


0 


17.5 


5.5 


0 


20 


6.5 


0 


22.5 


7 


0 


24 


Total volume of 
metered- in 
substrate solution 


22 .5 


99 



Preparing L-tert-leucine using a whole-cell catalyst at 
1 M and employing continuous metering (synthesis 
example 3) 

26.48 g of ammonium formate (corresponding to 2.8 
equivalents based on the total quantity of substrate 
employed), 150 |li1 of a 1 M solution of magnesium 
chloride (corresponding to a 1 mM concentration based 
on the final volume) and 19.49 g of the biocatalyst 
(E.coli JM105 (pAM3.25_10.1) biomass) are initially 
weighed into a 250 1 three-neck flask, after which 
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30 ml of deionized water are added. The pH is then 
adjusted to 7.0 by adding 32% ammonia. After the 
reaction temperature of 30°C has been reached, a total 
of 120 ml of a 1.25 M solution of trimethylpyruvic acid 
(pH 7.0, adjusted with 32% ammonia) are added 
continuously at a flow rate of 0.2 ml/min over a period 
of 10 hours. The final volume is 15 0 ml and the total 
concentration of substrate employed is 1.0 M, 
corresponding to a volumetric quantity of 
trimethylpyruvic acid of 130.1 g/1. A conversion of 96% 
(in accordance with HPLC) is observed after a reaction 
time of 27 h. 

Preparing L-tert-leucine using a whole-cell catalyst at 
7 00 mM and employing fed batch metering 

2.55 g of sodium formate (corresponds to 2.5 mol/1 
based on final volume) are initially added to a 
conically shaped 100 ml reaction flask belonging to a 
STAT Titrino 718, after which 15 ^1 of a 1 M solution 
of MgCl2 (corresponds to a final concentration of 1 mM) 
and 4.5 ml of a 1 M solution of TMP (pH 7, adjusted 
with 25% ammonia), and also 1.5% by vol. of toluene 
(based on the final volume) , are added. The volume is 
made up to 15 ml with deionized H2O . The reaction 
temperature of 30°C is kept stable, and controlled, by 
a closed-loop water circuit. 1 g of the biocatalyst 
moist biomass is resuspended in the substrate mixture 
and the pH is adjusted to 6 . 9 to 7 with 2 5% ammonia. 

After pH 7.5 has been reached, 4 . 5 ml of the 1 M TMP 
solution (pH 7) are added repeatedly. In this 
connection, the pH falls by approx. ApH = 0.3. As soon 
as pH 7.5 is reached, 4 . 5 ml of 1 M TMP solution are 
added once again. The addition of said volume of TMP is 
repeated lOx until the pH does not fall any further 
when TMP is added. In addition, 4 ml of a 4 M solution 
of sodium formate (corresponds, without taking any 
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reaction into consideration, to a concentration of 
973 mM in the medium) are added in connection with the 
eighth addition of TMP. The final volume is 64 ml, with 
a volumetric final concentration (without taking the 
reaction into consideration) of t rime thy Ipyruvic acid 
of 774 mM (100.6 g/1) . Sodium formate is present in 
solution at a final concentration of 836 mM. HPLC 
showed that 92% of the trime thy Ipyruvic acid had been 
converted after only 6h. 



The concentrations of the two substrates at the 
different addition points are listed in Table 5 below. 



Time 


Concentration 


Concentration 


Second 


[t in min] 


of trimethyl- 


of sodium 


addition of 




pyruvic acid 


formate 


sodium formate 




[mM] 


[mM] 




0 


300 


2500 




45 


461.54 


1923.08 




60 


562 .5 


1562.5 




75 


631.58 


1315.79 




90 


681.82 


1136.36 




105 


720 


1000 




120 


750 


892.86 




135 


774.19 


806.45 




150 


736.36 


972 . 73 


X 


180 


756.30 


899.16 




210 


773 .44 


835 .94 
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Patent claims: 

1. A process for preparing enantiomerically enriched 
L-a-amino acids or their salts by reacting the 
corresponding 2-ketocarboxylic acid with an 
ammonium ion donor in the presence of a whole- cell 
catalyst which comprises a cloned gene encoding a 
cofactor-dependent amino acid dehydrogenase and a 
cloned gene encoding an enzyme which regenerates 
the cofactor, at a total input of substrate per 
reaction voliime of > 500 mM, with the addition of 
the substrate being metered such that the 
stationary concentration of 2-ketocarboxylic acid 
is less than 500 mM and the external addition of 
cof actor, based on the total input of substrate, 
corresponds to < 0.0001 equivalents. 

2. The process as claimed in claim 1, 
characterized in that 

no cof actor is added to the reaction mixture. 

I. The process as claimed in claim 1 and/or 2, 
characterized in that 

use is made of 2-ketocarboxylic acids which yield 
amino acids of the general formula (I) 

R 

HgN^COOH 

(I) 

in which R is alkyl, in particular a space-filling 
branched alky 1 group which exhibits a tertiary C 
atom and possesses 5-10 C atoms, for example tert- 
butyl, and substituted alkyls. 
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The process as claimed in one or more of the 
preceding claims, 
characterized in that 

the substrate is metered in in accordance with a 
fed batch process. 

The process as claimed in one or more of the 
preceding claims, 
characterized in that 

the 2-ketocarboxylic acid is kept at a maximum 
stationary concentration of less than 450 mM, very 
preferably of less than 400 mM. 

The process as claimed in one or more of the 
preceding claims, 
characterized in that 

before it is used, the whole-cell catalyst is 
pretreated such that the permeability of the cell 
membrane for the substrate and products is 
increased as compared with the intact system. 
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Abstract : 

The present invention relates to a process for 
preparing, in particular, enantiomerically enriched 
5 L-a-amino acids, in particular those of the general 
formula (I) - 

R^COaH 
{I) 



10 In this connection, the process according to the 
invention uses 2-ketocarboxylic acids which are 
converted into the desired products using a whole-cell 
catalyst which comprises an amino acid dehydrogenase 
and a cof actor-regenerating enzyme. 



040055 AM 



Fig. 1 



1/5 




040055 AM 

Fig. 2 



2/5 




040055 AM 3/5 

Fig. 3 




040055 AM 

Fig . 4 




040055 MH 5/5 

Fig. 5 




0 S 10 15 20 25 

Induction time in h 



040055 AM 



- 1 - 



SEQUENCE LISTING 
<110> Degussa AG 

<120> Process for preparing optically active amino acids using a whole- 
cell catalyst 

<130> 040055 AM 

<160> 12 

<170> Patentin version 3.1 

<210> 1 

<211> 47 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 
<400> 1 

aaaaaactta agaaggagat atacatatga cattagaaat cttcgaa 47 

<210> 2 

<211> 32 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 
<400> 2 

aaaaaactgc agttagcgac ggctaataat at 32 



<210> 3 

<211> 30 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 

<400> 3 

aaaaaacata tgaagattgt cttagttctt 

<210> 4 

<211> 32 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 



<400> 4 

aaaaaagacg tcttatttct tatcgtgttt ac 



32 
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<210> 5 

<211> 1120 

5 <212> DNA 

<213> Bacillus cereus 

<220> 

<221> CDS 
10 <222> (20).. (1120) 
<223> 

<400> 5 

ttaagaagga gatatacat atg aca tta gaa ate ttc gaa tac tta gaa aaa 52 
15 Met Thr Leu Glu lie Phe Glu Tyr Leu Glu Lys 

15 10 

tat gat tat gag caa gta gta ttt tgt caa gat aaa gaa tct ggt tta 100 
Tyr Asp Tyr Glu Gin Val Val Phe Cys Gin Asp Lys Glu Ser Gly Leu 
20 15 20 25 

aaa gca att att gca att cat gat aca aca ctt gga ccg get ctt ggt 148 
Lys Ala lie He Ala He His Asp Thr Thr Leu Gly Pro Ala Leu Gly 
30 35 40 



25 



gga aca aga atg tgg aca tat gat tct gaa gaa gcg gcg att gaa gat 196 
Gly Thr Arg Met Trp Thr Tyr Asp Ser Glu Glu Ala Ala He Glu Asp 
45 50 55 



3 0 gca ttg cgt ctt gca aaa ggg atg aca tac aaa aac gca gca get ggt 
Ala Leu Arg Leu Ala Lys Gly Met Thr Tyr Lys Asn Ala Ala Ala Gly 
60 65 70 75 



244 



tta aac tta ggt ggt gcg aaa aca gta att ate ggt gat cct cgt aaa 292 
35 Leu Asn Leu Gly Gly Ala Lys Thr Val He He Gly Asp Pro Arg Lys 

80 85 90 

gat aag age gaa gca atg ttc cgt gca eta gga cgt tat ate caa gga 340 
Asp Lys Ser Glu Ala Met Phe Arg Ala Leu Gly Arg Tyr He Gin Gly 
^0 95 100 105 

eta aac gga cgt tac att aca get gaa gat gtt ggt aca aca gta gat 388 
Leu Asn Gly Arg Tyr He Thr Ala Glu Asp Val Gly Thr Thr Val Asp 
110 115 120 

45 

gat atg gat att ate cat gaa gaa act gae ttt gta aca ggt ate tea 436 
Asp Met Asp He He His Glu Glu Thr Asp Phe Val Thr Gly He Ser 
125 130 135 

50 cca tea ttc ggt tct tct ggt aac eca tct ccg gta act gca tac ggt 484 
Pro Ser Phe Gly Ser Ser Gly Asn Pro Ser Pro Val Thr Ala Tyr Gly 
140 145 150 155 

gtt tac cgt ggt atg aaa gca get gca aaa gaa get ttc ggt act gae 532 
55 Val Tyr Arg Gly Met Lys Ala Ala Ala Lys Glu Ala Phe Gly Thr Asp 

160 165 170 



aat tta gaa gga aaa gta att get gtt caa gge gtt ggt aac gta gca 
Asn Leu Glu Gly Lys Val He Ala Val Gin Gly Val Gly Asn Val Ala 



580 
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10 



175 180 185 

tat cac eta tgc aaa cat tta cac get gaa gga gca aaa tta att gtt 628 
Tyr His Leu Cys Lys His Leu His Ala Glu Gly Ala Lys Leu lie Val 
190 195 200 

aca gat att aat aaa gaa get gta caa egt get gta gaa gaa ttc ggt 676 
Thr Asp lie Asn Lys Glu Ala Val Gin Arg Ala Val Glu Glu Phe Gly 
205 210 215 

gca tea gca gtt gaa eca aat gaa att tac ggt gtt gaa tgc gat att 724 
Ala Ser Ala Val Glu Pro Asn Glu lie Tyr Gly Val Glu Cys Asp lie 
220 225 230 235 

15 tac gca eca tgt gca eta ggc gca aca gtt aat gat gaa act att eca 772 
Tyr Ala Pro Cys Ala Leu Gly Ala Thr Val Asn Asp Glu Thr lie Pro 

240 245 250 

caa ett aaa gca aaa gta ate gca ggt tct gcg aat aac caa tta aaa 820 
2 0 Gin Leu Lys Ala Lys Val lie Ala Gly Ser Ala Asn Asn Gin Leu Lys 
255 260 265 

gaa gat egt cat ggt gae ate att cat gaa atg ggt att gta tac gca 868 
Glu Asp Arg His Gly Asp lie lie His Glu Met Gly lie Val Tyr Ala 
25 270 275 280 

eca gat tat gta att aat gca ggt ggc gta att aac gta gca gae gaa 916 

Pro Asp Tyr Val lie Asn Ala Gly Gly Val lie Asn Val Ala Asp Glu 
285 290 295 

30 

tta tat gga tac aat aga gaa egt gca eta aaa egt gtt gag tct att 964 

Leu Tyr Gly Tyr Asn Arg Glu Arg Ala Leu Lys Arg Val Glu Ser lie 
300 305 310 315 

35 tat gae acg att gca aaa gta ate gaa att tea aaa cgc gat ggc ata 1012 
Tyr Asp Thr lie Ala Lys Val lie Glu lie Ser Lys Arg Asp Gly lie 
320 325 330 

gca act tat gta gcg gca gat egt eta get gaa gag cgc att gca age 1060 
40 Ala Thr Tyr Val Ala Ala Asp Arg Leu Ala Glu Glu Arg lie Ala Ser 
335 340 345 

ttg aag aat tct egt age act tac tta cgc aac ggt cac gat att att 1108 
Leu Lys Asn Ser Arg Ser Thr Tyr Leu Arg Asn Gly His Asp lie lie 
45 350 355 360 

age egt cgc taa 1120 
Ser Arg Arg 
365 

50 

<210> 6 
<211> 366 
<212> PRT 
55 <213> Bacillus cereus 

<400> 6 



Met Thr Leu Glu lie Phe Glu Tyr Leu Glu Lys Tyr Asp Tyr Glu Gin 
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15 10 15 

Val val Phe Cys Gin Asp Lys Glu Ser Gly Leu Lys Ala He He Ala 
5 20 25 30 

He His Asp Thr Thr Leu Gly Pro Ala Leu Gly Gly Thr Arg Met Trp 
35 40 45 

0 

Thr Tyr Asp Ser Glu Glu Ala Ala He Glu Asp Ala Leu Arg Leu Ala 
50 55 60 

5 

Lys Gly Met Thr Tyr Lys Asn Ala Ala Ala Gly Leu Asn Leu Gly Glv 
^5 70 75 80 

0 Ala Lys Thr Val He He Gly Asp Pro Arg Lys Asp Lys Ser Glu Ala 

85 90 95 

Met Phe Arg Ala Leu Gly Arg Tyr He Gin Gly Leu Asn Gly Arg Tyr 
100 105 110 

He Thr Ala Glu Asp Val Gly Thr Thr Val Asp Asp Met Asp He He 
115 120 125 

His Glu Glu Thr Asp Phe Val Thr Gly He Ser Pro Ser Phe Gly Ser 
130 135 140 

Ser Gly Asn Pro Ser Pro Val Thr Ala Tyr Gly Val Tyr Arg Gly 



145 150 



Met 

155 160 



Lys Ala Ala Ala Lys Glu Ala Phe Gly Thr Asp Asn Leu Glu Gly Lys 
165 170 175 

Val He Ala Val Gin Gly Val Gly Asn Val Ala Tyr His Leu Cys Lys 
180 185 190 

His Leu His Ala Glu Gly Ala Lys Leu He Val Thr Asp He Asn Lys 
195 200 205 

Glu Ala val Gin Arg Ala Val Glu Glu Phe Gly Ala Ser Ala Val Glu 
210 215 220 

Pro Asn Glu He oyr Gly Val Glu Cys Asp He ^r Ala Pro Cys Ala 

230 235 240 
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Leu Gly Ala Thr Val Asn Asp Glu Thr He Pro Gin Leu Lys Ala Lys 
245 250 255 

5 val He Ala Gly Ser Ala Asn Asn Gin Leu Lys Glu Asp Arg His Gly 
260 265 270 

Asp He He His Glu Met Gly He Val Tyr Ala Pro Asp Tyr Val He 
10 275 280 285 



15 



20 



Asn Ala Gly Gly Val He Asn Val Ala Asp Glu Leu Tyr Gly Tyr Asn 
290 295 300 

Arg Glu Arg Ala Leu Lys Arg Val Glu Ser He Tyr Asp Thr He Ala 

310 315 320 

Lys Val He Glu He Ser Lys Arg Asp Gly He Ala Thr Tyr Val Ala 
325 330 



25 Ala ASP Arg Leu Ala Glu Glu Arg He Ala Ser Leu Lys Asn Ser Arg 
340 345 350 

Ser Thr Tyr Leu Arg Asn Gly His Asp He He Ser Arg Arg 
■3U 355 3gQ 



365 



<210> 7 

<211> 1095 

35 <212> DNA 

<213> Candida boidinii 

<220> 

<221> CDS 

40 <222> (1)..{1095) 

<223> 

<400> 7 



d-^ Mo? ^""^ ""^^ Srct ggt aag cac get get gat 48 

45 Met Lys He Val Leu Val Leu Tyr Asp Ala Gly Lys His Ala Ala Lp 
^5 10 15 



50 



55 



Clu ftt T T ?r ^""^ ^""^ att get aat 

Glu Glu Lys Leu Tyr Gly Ser Thr Glu Asn Lys Leu Gly He Ala Asn 
20 25 30 

tgg tta aaa gat caa ggt cat gaa eta att act act tct gat aaa gaa 

Trp Leu Lys Asp Gin Gly His Glu Leu He Thr Thr Ser Asp Lys Slu 

40 45 

t^"" gat att ate 

Gly Glu Thr Ser Glu Leu Asp Lys His He Pro Asp Ala Asp He lie 

55 60 



96 



144 



192 
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ate acc act cct ttc cat cct get tat ate act aag gaa aga ett gac 240 
lie Thr Thr Pro Phe His Pro Ala Tyr lie Thr Lys Glu Arg Leu Asp 
65 70 75 80 

5 aag get aag aac tta aaa tta gte gtt gtc get ggt gtt ggt tct gat 288 
Lys Ala Lys Asn Leu Lys Leu Val Val Val Ala Gly Val Gly Ser Asp 
85 90 95 

cac att gat tta gat tat att aat caa aea ggt aag aaa ate tea gte 336 
10 His lie Asp Leu Asp Tyr lie Asn Gin Thr Gly Lys Lys He Ser Val 
100 105 110 



etg gaa gtt aea ggt tet aat gtt gtc tct gtt get gaa cac gtt gtc 
Leu Glu Val Thr Gly Ser Asn Val Val Ser Val Ala Glu His Val Val 
15 115 120 



20 



25 



30 



40 



45 



50 



125 



gat ate gaa ggt aaa act ate get acc att ggt get ggt aga att ggt 
Asp He Glu Gly Lys Thr He Ala Thr He Gly Ala Gly Arg He Gly 
165 170 175 

tac aga gtc ttg gaa aga tta etc cea ttt aat eca aaa gaa tta tta 
Tyr Arg Val Leu Glu Arg Leu Leu Pro Phe Asn Pro Lys Glu Leu Leu 
180 185 190 

tac tac gat tat caa get tta cea aaa gaa get gaa gaa aaa gtt ggt 
Tyr Tyr Asp Tyr Gin Ala Leu Pro Lys Glu Ala Glu Glu Lys Val Gly 



35 195 



200 205 



get aga aga gtt gaa aat att gaa gaa tta gtt get caa get gat ate 
Ala Arg Arg Val Glu Asn He Glu Glu Leu Val Ala Gin Ala Asp He 
210 215 220 

gtt aea gtt aat get eca tta cac gea ggt aea aaa ggt tta att aat 
Val Thr Val Asn Ala Pro Leu His Ala Gly Thr Lys Gly Leu He Asn 

230 235 240 

aag gaa tta tta tct aaa ttt aaa aaa ggt get tgg tta gte aat acc 
Lys Glu Leu Leu Ser Lys Phe Lys Lys Gly Ala Trp Leu Val Asn Thr 
245 250 255 

gea aga ggt get att get gtt get gaa gat gtt gea gea get tta gaa 
Ala Arg Gly Ala He Ala Val Ala Glu Asp Val Ala Ala Ala Leu Glu 
260 265 270 

tet ggt caa tta aga ggt tac ggt ggt gat gtt tgg ttc cea caa cea 
Ser Gly Gin Leu Arg Gly Tyr Gly Gly Asp Val Trp Phe Pro Gin Pro 



55 275 



280 285 



get cea aag gat cac eca tgg aga gat atg aga aat aaa tat ggt get 
Ala Pro Lys Asp His Pro Trp Arg Asp Met Arg Asn Lys Tyr Gly Ala 
290 295 300 



384 



atg acc atg ett gte ttg gtt aga aat ttc gtt cea gea eat gaa caa 432 
Met Thr Met Leu Val Leu Val Arg Asn Phe Val Pro Ala His Glu Gin 
130 135 140 

att att aac cac gat tgg gag gtt get get ate get aag gat get tac 480 
He He Asn His Asp Trp Glu Val Ala Ala He Ala Lys Asp Ala Tyr 
145 150 155 160 



528 



576 



624 



672 



720 



768 



816 



864 



912 
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ggt aat gcc atg act cct cac tac tct ggt act act tta gac get caa 
Gly Asn Ala Met Thr Pro His Tyr Ser Gly Thr Thr Leu Asp Ala Gin 
305 310 315 320 

aca aga tac get gaa ggt act aaa aat att ttg gaa tea ttc ttt acc 
Thr Arg Tyr Ala Glu Gly Thr Lys Asn lie Leu Glu Ser Phe Phe Thr 
325 330 335 

ggt aaa ttt gat tac aga cca caa gat att ate tta tta aat ggt gaa 
Gly Lys Phe Asp Tyr Arg Pro Gin Asp He He Leu Leu Asn Gly Glu 
340 345 350 

tac gtt act aaa get tac ggt aaa cac gat aag aaa taa 
Tyr Val Thr Lys Ala Tyr Gly Lys His Asp Lys Lys 
355 360 



<210> 8 

<211> 364 

<212> PRT 

<213> Candida boidinii 

<400> 8 

Met Lys He Val Leu Val Leu Tyr Asp Ala Gly Lys His Ala Ala Asp 
1 5 10 15 

Glu Glu Lys Leu Tyr Gly Ser Thr Glu Asn Lys Leu Gly He Ala Asn 
20 25 30 

Trp Leu Lys Asp Gin Gly His Glu Leu He Thr Thr Ser Asp Lys Glu 
35 40 45 

Gly Glu Thr Ser Glu Leu Asp Lys His He Pro Asp Ala Asp He He 
50 55 60 

He Thr Thr Pro Phe His Pro Ala Tyr He Thr Lys Glu Arg Leu Asp 
^5 70 75 80 

Lys Ala Lys Asn Leu Lys Leu Val Val Val Ala Gly Val Gly Ser Asp 
85 90 95 

His He Asp Leu Asp Tyr He Asn Gin Thr Gly Lys Lys He Ser Val 
100 105 110 

Leu Glu Val Thr Gly Ser Asn Val Val Ser Val Ala Glu His Val Val 
115 120 125 

Met Thr Met Leu Val Leu Val Arg Asn Phe Val Pro Ala His Glu Gin 
=^30 135 140 
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He He Asn His Asp Trp Glu Val Ala Ala He Ala Lys Asp Ala Tyr 

150 155 160 

Asp He Glu Gly Lys Thr He Ala Thr He Gly Ala Gly Arg He Gly 
165 170 175 

Tyr Arg Val Leu Glu Arg Leu Leu Pro Phe Asn Pro Lys Glu Leu Leu 
180 185 190 

Tyr Tyr Asp Tyr Gin Ala Leu Pro Lys Glu Ala Glu Glu Lys Val Gly 
195 200 205 

Ala Arg Arg Val Glu Asn He Glu Glu Leu Val Ala Gin Ala Asp He 
210 215 220 

val Thr val Asn Ala Pro Leu His Ala Gly Thr Lys Gly Leu He Asn 

230 235 240 

Lys Glu Leu Leu Ser Lys Phe Lys Lys Gly Ala Trp Leu Val Asn Thr 
245 250 255 

Ala Arg Gly Ala He Ala Val Ala Glu Asp Val Ala Ala Ala Leu Glu 
260 265 270 

Ser Gly Gin Leu Arg Gly TVr Gly Gly Asp Val Trp Phe Pro Gin Pro 
275 280 285 

oon ^""^ -^^9 Tyr Gly Ala 

290 295 300 

Gly Asn Ala Met Thr Pro His Tyr Ser Gly Thr Thr Leu Asp Ala Gin 
^ 310 315 

Thr Arg Tyr Ala Glu Gly Thr Lys Asn He Leu Glu Ser Phe Phe Thr 
325 330 

Gly Lys Phe Asp Tyr Arg Pro Gin Asp He He Leu Leu Asn Gly Glu 
340 345 35Q 

Tyr Val Thr Lys Ala Tyr Gly Lys His Asp Lys Lys 
355 360 
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<211> 5686 

<212> DNA 

<213> Artificial 

<220> 

<223> Plasmid pAM3.25 

<400> 9 

tatgaagatt gtcttagttc tttatgatgc tggtaagcac gctgctgatg aagaaaaatt 60 

atatggttct actgaaaata aattaggtat tgctaattgg ttaaaagatc aaggtcatga 120 

actaattact acttctgata aagaaggtga aacaagtgaa ttggataaac atatcccaga 180 

tgctgatatt atcatcacca ctcctttcca tcctgcttat atcactaagg aaagacttga 240 

caaggctaag aacttaaaat tagtcgttgt cgctggtgtt ggttctgatc acattgattt 300 

agattatatt aatcaaacag gtaagaaaat ctcagtcctg gaagttacag gttctaatgt 360 

tgtctctgtt gctgaacacg ttgtcatgac catgcttgtc ttggttagaa atttcgttcc 420 

agcacatgaa caaattatta accacgattg ggaggttgct gctatcgcta aggatgctta 480 

cgatatcgaa ggtaaaacta tcgctaccat tggtgctggt agaattggtt acagagtctt 540 

ggaaagatta ctcccattta atccaaaaga attattatac tacgattatc aagctttacc 600 

aaaagaagct gaagaaaaag ttggtgctag aagagttgaa aatattgaag aattagttgc 660 

tcaagctgat atcgttacag ttaatgctcc attacacgca ggtacaaaag gtttaattaa 720 

taaggaatta ttatctaaat ttaaaaaagg tgcttggtta gtcaataccg caagaggtgc 780 

tattgctgtt gctgaagatg ttgcagcagc tttagaatct ggtcaattaa gaggttacgg 840 

tggtgatgtt tggttcccac aaccagctcc aaaggatcac ccatggagag atatgagaaa 900 

taaatatggt gctggtaatg ccatgactcc tcactactct ggtactactt tagacgctca 960 

aacaagatac gctgaaggta ctaaaaatat tttggaatca ttctttaccg gtaaatttga 1020 

ttacagacca caagatatta tcttattaaa tggtgaatac gttactaaag cttacggtaa 1080 

acacgataag aaataagacg tcaagcttgg ctgttttggc ggatgagaga agattttcag 1140 

cctgatacag attaaatcag aacgcagaag cggtctgata aaacagaatt tgcctggcgg 12 00 

cagtagcgcg gtggtcccac ctgaccccat gccgaactca gaagtgaaac gccgtagcgc 1260 

cgatggtagt gtggggtctc cccatgcgag agtagggaac tgccaggcat caaataaaac 1320 

gaaaggctca gtcgaaagac tgggcctttc gttttatctg ttgtttgtcg gtgaacgctc 1380 

tcctgagtag gacaaatccg ccgggagcgg atttgaacgt tgcgaagcaa cggcccggag 1440 

ggtggcgggc aggacgcccg ccataaactg ccaggcatca aattaagcag aaggccatcc 1500 

tgacggatgg cctttttgcg tttctacaaa ctcttttgtt tatttttcta aatacattca 1560 
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10 



15 



20 



50 



55 



aatatgtatc cgctcatgag acaataaccc tgataaatgc ttcaataata ttgaaaaagg 1620 

aagagtatga gtattcaaca tttccgtgtc gcccttattc ccttttttgc ggcattttgc 1680 

cttcctgttt ttgctcaccc agaaacgctg gtgaaagtaa aagatgctga agatcagttg 1740 

ggtgcacgag tgggttacat cgaactggat ctcaacagcg gtaagatcct tgagagtttt 1800 

cgccccgaag aacgttttcc aatgatgagc acttttaaag ttctgctatg tggcgcggta I8 6 0 

ttatcccgtg ttgacgccgg gcaagagcaa ctcggtcgcc gcatacacta ttctcagaat 1920 

gacttggttg agtactcacc agtcacagaa aagcatctta cggatggcat gacagtaaga 1980 
gaattatgca gtgctgccat aaccatgagt gataacactg cggccaactt acttctgaca 



2040 

acgatcggag gaccgaagga gctaaccgct tttttgcaca acatggggga tcatgtaact 2100 

cgccttgatc gttgggaacc ggagctgaat gaagccatac caaacgacga gcgtgacacc 2160 

acgatgcctg tagcaatggc aacaacgttg cgcaaactat taactggcga actacttact 2220 

ctagcttccc ggcaacaatt aatagactgg atggaggcgg ataaagttgc aggaccactt 2280 

ctgcgctcgg cccttccggc tggctggttt attgctgata aatctggagc cggtgagcgt 2340 

gggtctcgcg gtatcattgc agcactgggg ccagatggta agccctcccg tatcgtagtt 2400 

30 atctacacga cggggagtca ggcaactatg gatgaacgaa atagacagat cgctgagata 2460 

ggtgcctcac tgattaagca ttggtaactg tcagaccaag tttactcata tatactttag 2520 

attgatttaa aacttcattt ttaatttaaa aggatctagg tgaagatcct ttttgataat 2580 

ctcatgacca aaatccctta acgtgagttt tcgttccact gagcgtcaga ccccgtagaa 2640 

aagatcaaag gatcttcttg agatcctttt tttctgcgcg taatctgctg cttgcaaaca 2700 

40 aaaaaaccac cgctaccagc ggtggtttgt ttgccggatc aagagctacc aactcttttt 2760 

ccgaaggtaa ctggcttcag cagagcgcag ataccaaata ctgtccttct agtgtagccg 2 82 0 

tagttaggcc accacttcaa gaactctgta gcaccgccta catacctcgc tctgctaatc 2880 

ctgttaccag tggctgctgc cagtggcgat aagtcgtgtc ttaccgggtt ggactcaaga 2940 

cgatagttac cggataaggc gcagcggtcg ggctgaacgg ggggttcgtg cacacagccc 3 000 

agcttggagc gaacgaccta caccgaactg agatacctac agcgtgagct atgagaaagc 3060 

gccacgcttc ccgaagggag aaaggcggac aggtatccgg taagcggcag ggtcggaaca 3120 

ggagagcgca cgagggagct tccaggggga aacgcctggt atctttatag tcctgtcggg 3180 

tttcgccacc tctgacttga gcgtcgattt ttgtgatgct cgtcaggggg gcggagccta 3240 

tggaaaaacg ccagcaacgc ggccttttta cggttcctgg ccttttgctg gccttttgct 3300 
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cacatgttct ttcctgcgtt atcccctgat tctgtggata accgtattac cgcctttgag 3360 

tgagctgata ccgctcgccg cagccgaacg accgagcgca gcgagtcagt gagcgaggaa 3420 

gcggaagagc gcctgatgcg gtattttctc cttacgcatc tgtgcggtat ttcacaccgc 3480 

atatatggtg cactctcagt acaatctgct ctgatgccgc atagttaagc cagtatacac 3540 

tccgctatcg ctacgtgact gggtcatggc tgcgccccga cacccgccaa cacccgctga 3600 

cgcgccctga cgggcttgtc tgctcccggc atccgcttac agacaagctg tgaccgtctc 3660 

cgggagctgc atgtgtcaga ggttttcacc gtcatcaccg aaacgcgcga ggcagctgcg 3720 

gtaaagctca tcagcgtggt cgtgaagcga ttcacagatg tctgcctgtt catccgcgtc 3780 

cagctcgttg agtttctcca gaagcgttaa tgtctggctt ctgataaagc gggccatgtt 3840 

aagggcggtt ttttcctgtt tggtcacttg atgcctccgt gtaaggggga atttctgttc 3900 

atgggggtaa tgataccgat gaaacgagag aggatgctca cgatacgggt tactgatgat 3960 

gaacatgccc ggttactgga acgttgtgag ggtaaacaac tggcggtatg gatgcggcgg 402 0 
gaccagagaa aaatcactca gggtcaatgc cagcgcttcg ttaatacaga tgtaggtgtt 



4080 



ccacagggta gccagcagca tcctgcgatg cagatccgga acataatggt gcagggcgct 4140 

gacttccgcg tttccagact ttacgaaaca cggaaaccga agaccattca tgttgttgct 4200 

caggtcgcag acgttttgca gcagcagtcg cttcacgttc gctcgcgtat cggtgattca 4260 

ttctgctaac cagtaaggca accccgccag cctagccggg tcctcaacga caggagcacg 4320 

atcatgcgca cccgtggcca ggacccaacg ctgcccgaga tgcgccgcgt gcggctgctg 4380 

gagatggcgg acgcgatgga tatgttctgc caagggttgg tttgcgcatt cacagttctc 444 0 

cgcaagaatt gattggctcc aattcttgga gtggtgaatc cgttagcgag gtgccgccgg 4500 

cttccattca ggtcgaggtg gcccggctcc atgcaccgcg acgcaacgcg gggaggcaga 4560 

caaggtatag ggcggcgcct acaatccatg ccaacccgtt ccatgtgctc gccgaggcgg 4620 

cataaatcgc cgtgacgatc agcggtccag tgatcgaagt taggctggta agagccgcga 4680 

gcgatccttg aagctgtccc tgatggtcgt catctacctg cctggacagc atggcctgca 4740 

acgcgggcat cccgatgccg ccggaagcga gaagaatcat aatggggaag gccatccagc 4800 

ctcgcgtcgc gaacgccagc aagacgtagc ccagcgcgtc ggccgccatg ccggcgataa 4860 

tggcctgctt ctcgccgaaa cgtttggtgg cgggaccagt gacgaaggct tgagcgaggg 4920 

cgtgcaagat tccgaatacc gcaagcgaca ggccgatcat cgtcgcgctc cagcgaaagc 4980 

ggtcctcgcc gaaaatgacc cagagcgctg ccggcacctg tcctacgagt tgcatgataa 5040 

agaagacagt cataagtgcg gcgacgatag tcatgccccg cgcccaccgg aaggagctga 5100 
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10 



ctgggttgaa ggctctcaag ggcatcggtc gacgctctcc cttatgcgac tcctgcatta 5160 

ggaagcagcc cagtagtagg ttgaggccgt tgagcaccgc cgccgcaagg aatggtgcat 5220 

gctcgatggc tacgagggca gacagtaagt ggatttacca taatccctta attgtacgca 5280 

ccgctaaaac gcgttcagcg cgatcacggc agcagacagg taaaaatggc aacaaaccac 5340 

cctaaaaact gcgcgatcgc gcctgataaa ttttaaccgt atgaatacct atgcaaccag 5400 

agggtacagg ccacattacc cccacttaat ccactgaagc tgccattttt catggtttca 5460 

ccatcccagc gaagggccat gcatgcatcg aaattaatac gacgaaatta atacgactca 5520 
ctatagggca attgcgatca ccacaattca gcaaattgtg aacatcatca cgttcatctt 
tccctggttg ccaatggccc attttcctgt cagtaacgag aaggtcgcga attcaggcgc 
tttttagact ggtcgtaatg aacaattctt aagaaggaga tataca 



20 



<210> 10 

<211> 5106 

25 <212> DNA 

<213> Artificial 



30 



<220> 

<223> Plasmid pAMlO.l 



40 



<400> 10 

gaaggagata tacatatgac attagaaatc ttcgaatact tagaaaaata tgattatgag 
caagtagtat tttgtcaaga taaagaatct ggtttaaaag caattattgc aattcatgat 
acaacacttg gaccggctct tggtggaaca agaatgtgga catatgattc tgaagaagcg 
gcgattgaag atgcattgcg tcttgcaaaa gggatgacat acaaaaacgc agcagctggt 
ttaaacttag gtggtgcgaa aacagtaatt atcggtgatc ctcgtaaaga taagagcgaa 
gcaatgttcc gtgcactagg acgttatatc caaggactaa acggacgtta cattacagct 
gaagatgttg gtacaacagt agatgatatg gatattatcc atgaagaaac tgactttgta 
acaggtatct caccatcatt cggttcttct ggtaacccat ctccggtaac tgcatacggt 
gtttaccgtg gtatgaaagc agctgcaaaa gaagctttcg gtactgacaa tttagaagga 
aaagtaattg ctgttcaagg cgttggtaac gtagcatatc acctatgcaa acatttacac 
gctgaaggag caaaattaat tgttacagat attaataaag aagctgtaca acgtgctgta 
gaagaattcg gtgcatcagc agttgaacca aatgaaattt acggtgttga atgcgatatt 
tacgcaccat gtgcactagg cgcaacagtt aatgatgaaa ctattccaca acttaaagca 
aaagtaatcg caggttctgc gaataaccaa ttaaaagaag atcgtcatgg tgacatcatt 



50 



5580 
5640 
5686 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
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900 
960 



catgaaatgg gtattgtata cgcaccagat tatgtaatta atgcaggtgg cgtaattaac 
gtagcagacg aattatatgg atacaataga gaacgtgcac taaaacgtgt tgagtctatt 

tatgacacga ttgcaaaagt aatcgaaatt tcaaaacgcg atggcatagc aacttatgta 1020 

gcggcagatc gtctagctga agagcgcatt gcaagcttga agaattctcg tagcacttac 1080 

ttacgcaacg gtcacgatat tattagccgt cgctaacgcg tttgcggttg gcaaaatggc 1140 

gcagcagcaa ggcgtggcgg tgaaaacctc tgccgaagcc ctgcaacagg ccattgacga 1200 

taatttctgg caagccgaat accgcgacta ccgccgtacc tccatctaaa agcttatcga 1260 

tgataagctg tcaaacatga gaattacaac ttatatcgta tggggctgac ttcaggtgct 1320 

acatttgaag agataaattg cactgaaatc tagaaatatt ttatctgatt aataagatga 1380 

tcttcttgag atcgttttgg tctgcgcgta atctcttgct ctgaaaacga aaaaaccgcc 1440 

ttgcagggcg gtttttcgaa ggttctctga gctaccaact ctttgaaccg aggtaactgg 1500 

cttggaggag cgcagtcacc aaaacttgtc ctttcagttt agccttaacc ggcgcatgac 1560 

ttcaagacta actcctctaa atcaattacc agtggctgct gccagtggtg cttttgcatg 1620 

tctttccggg ttggactcaa gacgatagtt accggataag gcgcagcggt cggactgaac 1680 
ggggggttcg tgcatacagt ccagcttgga gcgaactgcc tacccggaac tgagtgtcag 
gcgtggaatg agacaaacgc ggccataaca gcggaatgac accggtaaac cgaaaggcag 
gaacaggaga gcgcacgagg gagccgccag gggaaacgcc tggtatcttt atagtcctgt 

cgggtttcgc caccactgat ttgagcgtca gatttcgtga tgcttgtcag gggggcggag 1920 

cctatggaaa aacggctttg ccgcggccct ctcacttccc tgttaagtat cttcctggca 1980 

tcttccagga aatctccgcc ccgttcgtaa gccatttccg ctcgccgcag tcgaacgacc 2040 

gagcgtagcg agtcagtgag cgaggaagcg gaatatatcc tgtatcacat attctgctga 2100 

cgcaccggtg cagccttttt tctcctgcca catgaagcac ttcactgaca ccctcatcag 2160 

tgccaacata gtaagccagt atacactccg ctagcgctga tgtccggcgg tgcttttgcc 2220 
gttacgcacc accccgtcag tagctgaaca ggagggacag ctgatagaaa cagaagccac 
tggagcacct caaaaacacc atcatacact aaatcagtaa gttggcagca tcacccgacg 
cactttgcgc cgaataaata cctgtgacgg aagatcactt cgcagaataa ataaatcctg 

gtgtccctgt tgataccggg aagccctggg ccaacttttg gcgaaaatga gacgttgatc 2460 

ggcacgtaag aggttccaac tttcaccata atgaaataag atcactaccg ggcgtatttt 2520 

ttgagttatc gagattttca ggagctaagg aagctaaaat ggagaaaaaa atcactggat 2580 

ataccaccgt tgatatatcc caatggcatc gtaaagaaca ttttgaggca tttcagtcag 2640 



1740 
1800 
1860 



2280 
2340 
2400 
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30 



ttgctcaatg tacctataac cagaccgttc agctggatat tacggccttt ttaaagaccg 2700 
^ taaagaaaaa taagcacaag ttttatccgg cctttattca cattcttgcc cgcctgatga 2750 
atgctcatcc ggaattccgt atggcaatga aagacggtga gctggtgata tgggatagtg 2820 
ttcacccttg ttacaccgtt ttccatgagc aaactgaaac gttttcatcg ctctggagtg 288 0 
10 aataccacga cgatttccgg cagtttctac acatatattc gcaagatgtg gcgtgttacg 2940 
gtgaaaacct ggcctatttc cctaaagggt ttattgagaa tatgtttttc gtctcagcca 3000 
atccctgggt gagtttcacc agttttgatt taaacgtggc caatatggac aacttcttcg 3060 
cccccgtttt caccatgggc aaatattata cgcaaggcga caaggtgctg atgccgctgg 312 0 
cgattcaggt tcatcatgcc gtctgtgatg gcttccatgt cggcagaatg cttaatgaat 3180 
20 tacaacagta ctgcgatgag tggcagggcg gggcgtaatt tttttaaggc agttattggt 
gcccttaaac gcctggtgct acgcctgaat aagtgataat aagcggatga atggcagaaa 
ttcgaaagca aattcgaccc ggtcgtcggt tcagggcagg gtcgttaaat agccgcttat 
gtctattgct ggtttaccgg tttattgact accggaagca gtgtgaccgt gtgcttctca 
aatgcctgag gccagtttgc tcaggctctc cccgtggagg taataattga cgatatgatc 
atttattctg cctcccagag cctgataaaa acggttagcg cttcgttaat acagatgtag 
gtgttccaca gggtagccag cagcatcctg cgatgcagat ccggaacata atggtgcagg 
gcgcttgttt cggcgtgggt atggtggcag gccccgtggc cgggggactg ttgggcgctg 
ccggcacctg tcctacgagt tgcatgataa agaagacagt cataagtgcg gcgacgatag 3720 
tcatgccccg cgcccaccgg aaggagctac cggacagcgg tgcggactgt tgtaactcag 
aataagaaat gaggccgctc atggcgttga ctctcagtca tagtatcgtg gtatcaccgg 
ttggttccac tctctgttgc gggcaacttc agcagcacgt aggggacttc cgcgtttcca 
gactttacga aacacggaaa ccgaagacca ttcatgttgt tgctcaggtc gcagacgttt 
tgcagcagca gtcgcttcac gttcgctcgc gtatcggtga ttcattctgc taaccagtaa 
ggcaaccccg ccagcctagc cgggtcctca acgacaggag cacgatcatg cgcacccgtg 
50 gccaggaccc aacgctgccc gagatgcgcc gcgtgcggct gctggagatg gcggacgcga 4140 
tggatatgtt ctgccaaggg ttggtttgcg cattcacagt tctccgcaag aattgattgg 4200 
ctccaattct tggagtggtg aatccgttag cgaggtgccg ccggcttcca ttcaggtcga 4260 
ggtggcccgg ctccatgcac cgcgacgcaa cgcggggagg cagacaaggt atagggcggc 4320 
gcctacaatc catgccaacc cgttccatgt gctcgccgag gcggcataaa tcgccgtgac 4380 



40 



3240 

3300 

3360 

3420 

3480 

3540 

3500 

3 660 



3780 
3840 
3900 
3950 
4020 
4080 
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gatcagcggt ccagtgatcg aagttaggct ggtaagagcc gcgagcgatc cttgaagctg 4440 
tccctgatgg tcgtcatcta cctgcctgga cagcatggcc tgcaacgcgg gcatcccgat 4500 

5 gccgccggaa gcgagaagaa tcataatggg gaaggccatc cagcctcgcg tcgcgaacgc 4560 

cagcaagacg tagcccagcg cgtcggccgc catgccggcg ataatggcct gcttctcgcc 4620 

gaaacgtttg gtggcgggac cagtgacgaa ggcttgagcg agggcgtgca agattccgaa 4680 

taccgcaagc gacaggccga tcatcgtcgc gctccagcga aagcggtcct cgccgaaaat 4740 

gacccagagc gctgccggca cctgtcctac gagttgcatg ataaagaaga cagtcataag 4800 

15 tgcggcgacg atagtcatgc cccgcgccca ccggaaggag ctgactgggt tgaaggctct 4860 

caagggcatc ggtcgacgct ctcccttatg cgactcctgc attaggaagc agcccagtag 4920 

taggttgagg ccgttgagca ccgccgccgc aaggaatggt gcatgcatcg atcaccacaa 4980 

ttcagcaaat tgtgaacatc atcacgttca tctttccctg gttgccaatg gcccattttc 5040 

ctgtcagtaa cgagaaggtc gcgaattcag gcgcttttta gactggtcgt aatgaacaat 5100 



25 tcttaa 



<210> 11 

<211> 5597 

30 <212> DNA 

<213> Unknown 



35 



40 



55 



<220> 

<223> Plasmid 
<220> 

<221> CDS 

<222> (25).. (1749) 

<223> scfA - malic enzyme gene 



5 

p^" f^.t f.".^ f ate cct tac get 

15 



Met Glu Pro Lys Thr Lys Lys Gin Arg Ser Z^e^ lie P^o 5r IL 

50 20 25 



riZ I ''''^ '^''^ "9 aaa ggc agt gcc ttc 

Gly Pro Val Leu Leu Glu Phe Pro Leu Leu Asn Lys Sly sir Sa pJe 
30 35 40 

Ser Met Su Sin 1^'" "a ctg ccg gaa 

Ser Met Glu Glu Arg Arg Asn Phe Asn Leu Leu Gly Leu Leu Pro Glu 



50 



55 



gtg gtc gaa acc ate gaa gaa caa gcg gaa cga gca tgg ate eag tat 



5106 



<400> 11 

aattcttaag aaggagatat acat atg gat att eaa aaa aga gtg agt gac 51 
45 Asp lie Gin Lys Arg Val Ser Asp 



99 



147 



195 



243 
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Val Val Glu Thr lie Glu Glu Gin Ala Glu Arg Ala Trp lie Gin Tyr 
60 65 70 

cag gga ttc aaa acc gaa ate gac aaa cac ate tac ctg cgt aac ate 
Gin Gly Phe Lys Thr Glu lie Asp Lys His lie Tyr Leu Arg Asn lie 
75 80 85 

cag gac act aac gaa acc etc ttc tac cgt ctg gta aac aat cat ctt 
Gin Asp Thr Asn Glu Thr Leu Phe Tyr Arg Leu Val Asn Asn His Leu 
90 95 100 105 

gat gag atg atg ect gtt att tat acc cea acc gtc ggc gea gcc tgt 
Asp Glu Met Met Pro Val lie Tyr Thr Pro Thr Val Gly Ala Ala Cys 
110 115 120 

gag cgt ttt tct gag ate tac cgc cgt tea egc ggc gtg ttt ate tct 
Glu Arg Phe Ser Glu lie Tyr Arg Arg Ser Arg Gly Val Phe lie Ser 
125 130 135 

tac cag aac egg cac aat atg gac gat att ctg caa aac gtg ceg aac 
Tyr Gin Asn Arg His Asn Met Asp Asp lie Leu Gin Asn Val Pro Asn 
140 145 150 

cat aat att aaa gtg att gtg gtg act gac ggt gaa cgc att ctg ggg 
His Asn lie Lys Val lie Val Val Thr Asp Gly Glu Arg He Leu Gly 
155 160 165 

ctt ggt gac cag ggc ate ggc ggg atg ggc att ccg ate ggt aaa ctg 
Leu Gly Asp Gin Gly He Gly Gly Met Gly He Pro He Gly Lys Leu 
170 175 180 185 

teg etc tat acc gcc tgt ggc ggc ate age ccg gcg tat acc ctt ccg 
Ser Leu Tyr Thr Ala Cys Gly Gly He Ser Pro Ala Tyr Thr Leu Pro 
190 195 200 

gtg gtg ctg gat gtc gga aeg aac aac caa cag ctg ctt aac gat ccg 
Val Val Leu Asp Val Gly Thr Asn Asn Gin Gin Leu Leu Asn Asp Pro 
205 210 215 

ctg tat atg ggc tgg cgt aat ccg cgt ate act gac gac gaa tac tat 
Leu Tyr Met Gly Trp Arg Asn Pro Arg He Thr Asp Asp Glu Tyr Tyr 
220 225 230 

gaa ttc gtt gat gaa ttt ate cag get gtg aaa caa egc tgg cea gac 
Glu Phe Val Asp Glu Phe He Gin Ala Val Lys Gin Arg Trp Pro Asp 
235 240 245 

gtg ctg ttg cag ttt gaa gac ttt get caa aaa aat gcg atg ccg tta 
Val Leu Leu Gin Phe Glu Asp Phe Ala Gin Lys Asn Ala Met Pro Leu 
250 255 260 265 

ctt aac cgc tat cgc aat gaa att tgt tct ttt aac gat gac att cag 
Leu Asn Arg Tyr Arg Asn Glu He Cys Ser Phe Asn Asp Asp He Gin 
270 275 280 

ggc act gcg gcg gta aca gtc ggc aca ctg ate gea gea age cgc gcg 
Gly Thr Ala Ala Val Thr Val Gly Thr Leu He Ala Ala Ser Arg Ala 
285 290 295 
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gca ggt ggt cag tta age gag aaa aaa ate gtc ttc ctt ggc gca ggt 
Ala Gly Gly Gin Leu Ser Glu Lys Lys lie Val Phe Leu Gly Ala Gly 
300 305 310 

tea geg gga tgc ggc att gee gaa atg ate ate tec cag acc cag cgc 
Ser Ala Gly Cys Gly He Ala Glu Met He He Ser Gin Thr Gin Arg 
315 320 325 

gaa gga tta age gag gaa gcg gcg egg cag aaa gtc ttt atg gtc gat 
Glu Gly Leu Ser Glu Glu Ala Ala Arg Gin Lys Val Phe Met Val Asp 
330 335 340 345 

cgc ttt ggc ttg ctg act gac aag atg ccg aac ctg ctg cet ttc eag 
Arg Phe Gly Leu Leu Thr Asp Lys Met Pro Asn Leu Leu Pro Phe Gin 
350 355 360 

aec aaa ctg gtg cag aag cgc gaa aac etc agt gac tgg gat acc gae 
Thr Lys Leu Val Gin Lys Arg Glu Asn Leu Ser Asp Trp Asp Thr Asp 
365 370 375 

age gat gtg ctg tea ctg ctg gat gtg gtg cgc aat gta aaa cca gat 
Ser Asp Val Leu Ser Leu Leu Asp Val Val Arg Asn Val Lys Pro Asp 
380 385 390 

att ctg att ggc gtc tea gga cag acc ggg ctg ttt aeg gaa gag ate 
He Leu He Gly Val Ser Gly Gin Thr Gly Leu Phe Thr Glu Glu He 
395 400 405 

ate cgt gag atg cat aaa cac tgt ccg cgt ccg ate gtg atg ccg ctg 
He Arg Glu Met His Lys His Cys Pro Arg Pro He Val Met Pro Leu 

415 420 425 

tet aac ccg aeg tea cgc gtg gaa gcc aca ccg cag gac att ate gcc 
Ser Asn Pro Thr Ser Arg Val Glu Ala Thr Pro Gin Asp He He Ala 
430 435 

tgg acc gaa ggt aac gcg ctg gtc gcc aeg ggc age ccg ttt aat cca 
Trp Thr Glu Gly Asn Ala Leu Val Ala Thr Gly Ser Pro Phe Asn Pro 
445 450 455 

gtg gta tgg aaa gat aaa ate tae cet ate gcc cag tgt aac aac gcc 
Val Val Trp Lys Asp Lys He Tyr Pro He Ala Gin Cys Asn Asn Ala 



460 465 



470 



ttt att ttc ccg ggc ate ggc ctg ggt gtt att get tec ggc gcg tea 
Phe He Phe Pro Gly He Gly Leu Gly Val He Ala Ser Gly Ala Ser 
475 480 485 

cgt ate ace gat gag atg ctg atg teg gca agt gaa aeg ctg gcg cag 
Arg He Thr Asp Glu Met Leu Met Ser Ala Ser Glu Thr Leu Ala Gin 
490 495 500 505 

tat tea cca ttg gtg ctg aac ggc gaa ggt atg gta ctg ccg gaa ctg 
Tyr Ser Pro Leu Val Leu Asn Gly Glu Gly Met Val Leu Pro Glu Leu 
510 515 520 

aaa gat att cag aaa gtc tec cgc gca att gcg ttt gcg gtt ggc aaa 
Lys Asp He Gin Lys Val Ser Arg Ala He Ala Phe Ala Val Gly Lys 
525 530 535 
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10 



20 



50 



atg gcg cag cag caa ggc gtg gcg gtg aaa acc tct gcc gaa gcc ctg 1683 
Met Ala Gin Gin Gin Gly Val Ala Val Lys Thr Ser Ala Glu Ala Leu 
540 545 550 

caa cag gcc att gac gat aat ttc tgg caa gcc gaa tac cgc gac tac 1731 
Gin Gin Ala lie Asp Asp Asn Phe Trp Gin Ala Glu Tyr Arg Asp Tyr 
555 560 565 . 

cgc cgt acc tec ate taa aagcttatcg atgataagct gtcaaacatg 1779 
Arg Arg Thr Ser lie 
570 

agaattacaa cttatatcgt atggggctga cttcaggtgc tacatttgaa gagataaatt 1839 
gcactgaaat ctagaaatat tttatctgat taataagatg atcttcttga gatcgttttg 1899 
gtctgcgcgt aatctcttgc tctgaaaacg aaaaaaccgc cttgcagggc ggtttttcga 1959 
aggttctctg agctaccaac tctttgaacc gaggtaactg gcttggagga gcgcagtcac 2019 
caaaacttgt cctttcagtt tagccttaac cggcgcatga cttcaagact aactcctcta 2 079 
aatcaattac cagtggctgc tgccagtggt gcttttgcat gtctttccgg gttggactca 2139 
agacgatagt taccggataa ggcgcagcgg tcggactgaa cggggggttc gtgcatacag 2199 
tccagcttgg agcgaactgc ctacccggaa ctgagtgtca ggcgtggaat gagacaaacg 2259 
30 cggccataac agcggaatga caccggtaaa ccgaaaggca ggaacaggag agcgcacgag 2319 
ggagccgcca ggggaaacgc ctggtatctt tatagtcctg tcgggtttcg ccaccactga 
tttgagcgtc agatttcgtg atgcttgtca ggggggcgga gcctatggaa aaacggcttt 
gccgcggccc tctcacttcc ctgttaagta tcttcctggc atcttccagg aaatctccgc 2499 
cccgttcgta agccatttcc gctcgccgca gtcgaacgac cgagcgtagc gagtcagtga 2559 
40 gcgaggaagc ggaatatatc ctgtatcaca tattctgctg acgcaccggt gcagcctttt 
ttctcctgcc acatgaagca cttcactgac accctcatca gtgccaacat agtaagccag 
tatacactcc gctagcgctg atgtccggcg gtgcttttgc cgttacgcac caccccgtca 
gtagctgaac aggagggaca gctgatagaa acagaagcca ctggagcacc tcaaaaacac 
catcatacac taaatcagta agttggcagc atcacccgac gcactttgcg ccgaataaat 2859 
acctgtgacg gaagatcact tcgcagaata aataaatcct ggtgtccctg ttgataccgg 2919 
gaagccctgg gccaactttt ggcgaaaatg agacgttgat cggcacgtaa gaggttccaa 2979 
ctttcaccat aatgaaataa gatcactacc gggcgtattt tttgagttat cgagattttc 3039 
aggagctaag gaagctaaaa tggagaaaaa aatcactgga tataccaccg ttgatatatc 3099 
ccaatggcat cgtaaagaac attttgaggc atttcagtca gttgctcaat gtacctataa 



2379 
2439 



2619 
2679 
2739 
2799 



3159 
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ccagaccgtt cagctggata ttacggcctt tttaaagacc gtaaagaaaa ataagcacaa 3219 

gttttatccg gcctttattc acattcttgc ccgcctgatg aatgctcatc cggaattccg 3279 

tatggcaatg aaagacggtg agctggtgat atgggatagt gttcaccctt gttacaccgt 3339 

tttccatgag caaactgaaa cgttttcatc gctctggagt gaataccacg acgatttccg 3399 

gcagtttcta cacatatatt cgcaagatgt ggcgtgttac ggtgaaaacc tggcctattt 3459 

ccctaaaggg tttattgaga atatgttttt cgtctcagcc aatccctggg tgagtttcac 3519 

cagttttgat ttaaacgtgg ccaatatgga caacttcttc gcccccgttt tcaccatggg 3579 

caaatattat acgcaaggcg acaaggtgct gatgccgctg gcgattcagg ttcatcatgc 3 63 9 

cgtctgtgat ggcttccatg tcggcagaat gcttaatgaa ttacaacagt actgcgatga 3699 

gtggcagggc ggggcgtaat ttttttaagg cagttattgg tgcccttaaa cgcctggtgc 3759 

tacgcctgaa taagtgataa taagcggatg aatggcagaa attcgaaagc aaattcgacc 3819 

cggtcgtcgg ttcagggcag ggtcgttaaa tagccgctta tgtctattgc tggtttaccg 3879 

gtttattgac taccggaagc agtgtgaccg tgtgcttctc aaatgcctga ggccagtttg 3939 

ctcaggctct ccccgtggag gtaataattg acgatatgat catttattct gcctcccaga 3999 

gcctgataaa aacggttagc gcttcgttaa tacagatgta ggtgttccac agggtagcca 4059 

gcagcatcct gcgatgcaga tccggaacat aatggtgcag ggcgcttgtt tcggcgtggg 4119 

tatggtggca ggccccgtgg ccgggggact gttgggcgct gccggcacct gtcctacgag 4179 

ttgcatgata aagaagacag tcataagtgc ggcgacgata gtcatgcccc gcgcccaccg 4239 

gaaggagcta ccggacagcg gtgcggactg ttgtaactca gaataagaaa tgaggccgct 4299 

catggcgttg actctcagtc atagtatcgt ggtatcaccg gttggttcca ctctctgttg 4359 

cgggcaactt cagcagcacg taggggactt ccgcgtttcc agactttacg aaacacggaa 4419 

accgaagacc attcatgttg ttgctcaggt cgcagacgtt ttgcagcagc agtcgcttca 4479 

cgttcgctcg cgtatcggtg attcattctg ctaaccagta aggcaacccc gccagcctag 4539 

ccgggtcctc aacgacagga gcacgatcat gcgcacccgt ggccaggacc caacgctgcc 4599 

cgagatgcgc cgcgtgcggc tgctggagat ggcggacgcg atggatatgt tctgccaagg 4659 

gttggtttgc gcattcacag ttctccgcaa gaattgattg gctccaattc ttggagtggt 4719 

gaatccgtta gcgaggtgcc gccggcttcc attcaggtcg aggtggcccg gctccatgca 4779 

ccgcgacgca acgcggggag gcagacaagg tatagggcgg cgcctacaat ccatgccaac 4839 

ccgttccatg tgctcgccga ggcggcataa atcgccgtga cgatcagcgg tccagtgatc 4899 

gaagttaggc tggtaagagc cgcgagcgat ccttgaagct gtccctgatg gtcgtcatct 4959 
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10 



acctgcctgg acagcatggc ctgcaacgcg ggcatcccga tgccgccgga agcgagaaga 5019 
atcataatgg ggaaggccat ccagcctcgc gtcgcgaacg ccagcaagac gtagcccagc 5079 
gcgtcggccg ccatgccggc gataatggcc tgcttctcgc cgaaacgttt ggtggcggga 5139 
ccagtgacga aggcttgagc gagggcgtgc aagattccga ataccgcaag cgacaggccg 
atcatcgtcg cgctccagcg aaagcggtcc tcgccgaaaa tgacccagag cgctgccggc 
acctgtccta cgagttgcat gataaagaag acagtcataa gtgcggcgac gatagtcatg 
ccccgcgccc accggaagga gctgactggg ttgaaggctc tcaagggcat cggtcgacgc 
tctcccttat gcgactcctg cattaggaag cagcccagta gtaggttgag gccgttgagc 
accgccgccg caaggaatgg tgcatgcatc gatcaccaca attcagcaaa ttgtgaacat 
catcacgttc atctttccct ggttgccaat ggcccatttt cctgtcagta acgagaaggt 
cgcgaattca ggcgcttttt agactggtcg taatgaac 



20 



25 <210> 12 

<211> 574 

<212> PRT 

<213> Unknown 

30 <220> 

<223> Plasmid 



35 



40 



<400> 12 

Met ASP lie Gin Lys Arg Val Ser Asp Met Glu Pro Lys Thr Lys Lys 



10 



15 

Gin Arg Ser Leu Tyr He Pro Tyr Ala Gly Pro Val Leu Leu Glu Phe 



25 



30 



50 



Pro Leu Leu Asn Lys Gly Ser Ala Phe Ser Met Glu Glu Arg Arg Asn 

^0 45 

Phe Asn Leu Leu Gly Leu Leu Pro Glu Val Val Glu Thr He Glu Glu 

60 

Gin Ala Glu Arg Ala Trp He Gin ^r Gin Gly Phe Lys Thr Glu He 

75 80 

55 ASP Lys His He Tyr Leu Arg Asn He Gin Asp Thr Asn Glu Thr Leu 

90 95 

Phe TVr Arg Leu Val Asn Asn His Leu Asp Glu Met Met Pro Val He 



5199 
5259 
5319 
5379 
5439 
5499 
5559 
5597 
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100 



110 



^ lyr Thr Pro Thr Val Gly Ala Ala Cys Glu Arg Phe Ser Glu He Tyr 
Arg Arg Ser Arg Gly Val Phe He Ser Tyr Gin Asn Arg His Asn Met 

10 

ASP ASP He Leu Gin Asn Val Pro Asn His Asn He Lys Val He Val 



150 



155 



15 



160 



val Thr ASP Gly Glu Arg He Leu Gly Leu Gly Asp Gin Gly He Gly 

170 175 

2 0 Gly Met Gly lie Pro He Gly Lys Leu Ser Leu ^r Thr Ala Cys Gly 

190 

25 ^'"^ tts """^ ASP val Gly Thr 

200 205 

As„ js„ Gl„ 01„ Leu L.U J.„ ^.p p„ ^ 

30 ^ 220 

pro Arg He Thr Asp Asp Glu Tyr Tyr Glu Phe Val Asp Glu Phe He 

240 

35 

Gin Ala val Lys Gin Arg Trp Pro Asp Val Leu Leu Gin Phe Glu Asp 

250 255 

40 Phe M. Oin ^„ ^. ^ 

265 270 

lie cys ser Phe Asn Asp Asp He Gin Gly Thr Ala Ala Val Thr Val 

ZHO 285 

Oly Thr Leu Xle Ala Ala Ser Arg Ala Ala Gly Gly Gin Leu Ser Glu 

50 " 300 

Lys .y. He Vel Phe Leu oly sly Ser Al. OXy Cys Oly He Ala 



55 

Glu Met He He Ser Gin Thr Gin Arg Glu Gly 



320 



--- .... Arg i^iu Gly Leu Ser Glu Glu Ala 

330 
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Ala Arg Gin Lys Val Phe Met Val Arg Phe Gly Leu Leu Thr Asp 

5 Lys Met Pro Asn Leu Leu Pro Phe Gin Thr Lys Leu Val Gin Lys Arg 

365 

10 sTo ''"'^ J,^? val Leu Ser Leu Leu 

380 

ASP val val Arg Asn Val Lys Pro Asp He Leu He Gly Val Ser Gly 



15 



20 



400 



Gin Thr Gly Leu Phe Thr Glu Glu He He Arg Glu Met His Lys His 



410 



415 



cy. P„ p„ V.1 Met P„ s.r ^s. .ro se. v,l 

25 01. .la ™, P.O ol„ ..p 



445 



30 ^'^ ^^^^-^ JJI V.1 val .^a Asp Ha 

460 



35 



40 



Tyr Pro Xle Ala Gin Cys Asn Asn Ala Phe He Phe Pro Gly He Gly 



475 



480 



Leu aly val Ha .la Se. aiy «. .a. xla Th. .ap Olu Hat .au 



490 



495 



Val Ser 



«et =a. «. s„ Clu Th, .a„ .1, .i ^a. P„ .au Val .a„ 

« =ly Glu 01^ »at val r.au P„ jlu Lau .y. A.p xle Gl„ .ya 

525 

50 Jso "^'^ ^-t: Ala Gin Gin Gin Gly Val 

^■^^ 540 

Ala val sa, jl. olu .au «1„ ol„ Ma Ha A.p Aap .an 



555 



Phe Trp Gin Ala Glu Tyr Arg Asp Tyr Arg Arg Thr Ser He 



560 



